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HUMAN TRANSPORT PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human transport proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of transport, metabolic, 
neurological, cardiovascular, reproductive, and immune disorders, and cell proliferative disorders 
including cancer. 

BACKGROUND OF THE INVENTION 

Eukaryotic cells are surrounded and subdivided into functionally distinct organelles by 
hydrophobic lipid bilayer membranes. These membranes act as a barrier to most molecules, and 
maintain the essential differences between the cytosol, the extracellular environment, and the contents 
of each intracellular organelle. Transport of essential nutrients, certain metal ions, metabolic waste 
products, cell signaling molecules, macromolecules, and proteins across lipid membranes and 
between organelles must be mediated by a variety of transport molecules. Transport between the 
cytoplasm and the extracellular environment, and between the cytoplasm and lumenal spaces of 
cellular organelles requires specific transport proteins. Each transport protein carries a particular 
class of molecule, such as ions, sugars, or amino acids, and often is specific to a certain molecular 
species of the class. 

Cells and organelles require transport molecules to import and export essential nutrients and 
metal ions including K\ NH 4 + , P { , S0 4 2 ", sugars, and vitamins, as well as various metabolic waste 
products. Transport proteins also play roles in antibiotic resistance, toxin secretion, ion balance, 
synaptic neurotransmission, kidney function, intestinal absorption, tumor growth, and other diverse 
cell functions (Griffith, J. and C. Sansom (1998) The Transporter Facts Book . Academic Press, San 
Diego CA, pp. 3-29). Transport can occur by a passive concentration-dependent mechanism, or can 
be linked to an energy source such as ATP hydrolysis or an ion gradient. Proteins that function in 
transport include carrier proteins, which bind to a specific solute and undergo a conformational 
change that transfers the bound solute across the membrane, and channel proteins, which form 
hydrophilic pores that allow specific solutes to diffuse through the membrane down an 
electrochemical solute gradient. 

Transport proteins are often multi-pass transmembrane proteins, which either actively 
transport molecules across the membrane or passively allow them to cross. Active transport involves 
directional pumping of a solute across the membrane, usually against an electrochemical gradient. 
Active transport is tightly coupled to a source of metabolic energy, such as ATP hydrolysis or an 
electrochemically favorable ion gradient. Passive transport involves the movement of a solute down 
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its electrochemical gradient. Transport proteins can be further classified as either carrier proteins or 
channel proteins. Carrier proteins, which can function in active or passive transport, bind to a 
specific solute to be transported and undergo a conformational change which transfers the bound 
solute across the membrane. Channel proteins, which only function in passive transport, form 
hydrophilic pores across the membrane. When the pores open, specific solutes, such as inorganic 
ions, pass through the membrane and down the electrochemical gradient of the solute. Examples 
include facilitative transporters, the secondary active symporters and antiporters driven by ion 
gradients, and active ATP binding cassette transporters involved in multiple-drug resistance and 
targeting of antigenic peptides to MHC Class I molecules. Transported substrates range from 
nutrients and ions to a broad variety of drugs, peptides and proteins. 

Information on the action of ARL-6 (ADP-ribosylation like factor), an endoplasmic reticulum 
transmembrane protein, can be found in Greenfield, J.J. and S. High (1999; J. Cell Sci. 1 12:1477- 
1486). Information on reduced folate carrier transporter proteins can be found in Dixon, K.H. et al. 
(1994; J. Biol. Chem. 269:17-20) and Moscow, J.A. et al. (1995; Cancer Res. 55:5983-5987). 

Carrier proteins which transport a single solute from one side of the membrane to the other 
are called uniporters. In contrast, coupled transporters link the transfer of one solute with 
simultaneous or sequential transfer of a second solute, either in the same direction (symport) or in the 
opposite direction (amiport). For example, intestinal and kidney epithelia contain a variety of 
symporter systems driven by the sodium gradient that exists across the plasma membrane. Sodium 
moves into the cell down its electrochemical gradient and brings the solute into the cell with it. The 
sodium gradient that provides the driving force for solute uptake is maintained by the ubiquitous 
Na7K + ATPase. Sodium-coupled transporters include the mammalian glucose transporter (SGLT1), 
iodide transporter (NIS), and multivitamin transporter (SMVT). All three transporters have twelve 
putative transmembrane segments, extracellular glycosylation sites, and cytoplasmically-oriented N- 
and C-termini. 

Mitochondrial carrier proteins are transmembrane-spanning proteins which transport ions and 
charged metabolites between the cytosol and the mitochondrial matrix. Examples include the ADP, 
ATP carrier protein; the 2-oxoglutarate/malate carrier; the phosphate carrier protein; the brown fat 
uncoupling protein which transports protons from the cytosol into the matrix; the pyruvate carrier; the 
dicarboxylate carrier which transports malate, succinate, fumarate, and phosphate; the tricarboxylate 
carrier which transports citrate and malate; and the Grave's disease carrier protein, a protein 
recognized by IgG in patients with active Grave's disease, an autoimmune disorder resulting in 
hyperthyroidism (Stryer, L. (1995) Biochemistry . W.H. Freeman and Company, New York NY, p. 
55 1 ; PROSITE PDOC001 89 Mitochondrial energy transfer proteins signature; Online Mendelian 
Inheritance in Man (OMIM) *275000 Graves Disease). 

This class of transporters also includes the mitochondrial uncoupling proteins, which create 
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proton leaks across the inner mitochondrial membrane, thus uncoupling oxidative phosphorylation 
from ATP synthesis. The result is energy dissipation in the form of heat. Mitochondrial uncoupling 
proteins have been implicated as modulators of thermoregulation and metabolic rate, and have been 
proposed as potential targets for drugs against metabolic diseases such as obesity (Ricquier, D. et al. 
5 (1999) J. Int. Med. 245:637-642). 

A number of metal ions such as iron, zinc, copper, cobalt, manganese, molybdenum, 
selenium, nickel, and chromium are important as cofactors for a number of enzymes. For example, 
zinc is required for the function of enzymes such as the extracellular matrix metalloproteinases, and 
zinc ions stabilize several motifs commonly found in transcription factors, including zinc fingers, zinc 
10 clusters, and LIM domains. Zinc and other metal ions must be provided in the diet, and are absorbed 
by transporters in the gastrointestinal tract. Plasma proteins transport the metal ions to the liver and 
other target organs, where specific transporters move the ions into cells and cellular organelles as 
needed. Imbalances in metal ion metabolism have been associated with a number of disease states 
(Danks, D.M. (1986) J. Med. Genet. 23:99-106). 

15 The largest and most diverse family of transport proteins known are the ATP-binding cassette 

(ABC) transporters. As a family, ABC transporters can transport substances that differ markedly in 
chemical structure and size, ranging from small molecules such as ions, sugars, amino acids, peptides, 
and phospholipids, to lipopeptides, large proteins, and complex hydrophobic drugs. ABC proteins 
consist of four modules: two nucleotide-binding domains (NBD), which hydrolyze ATP to supply the 

20 energy required for transport, and two membrane-spanning domains (MSD), each containing six 
putative transmembrane segments. These four modules may be encoded by a single gene, as is the 
case for the cystic fibrosis transmembrane regulator (CFTR), or by separate genes. When encoded by 
separate genes, each gene product contains a single NBD and MSD. These "half-molecules" form 
homo- and heterodimers, such as Tapl and Tap2, the endoplasmic reticulum-based major 

25 histocompatibility (MHC) peptide transport system. Several genetic diseases are attributed to defects 
in ABC transporters, such as the following diseases and their corresponding proteins: cystic fibrosis 
(CFTR, an ion channel), adrenoleukodystrophy (adrenoleukodystrophy protein, ALDP), Zellweger 
syndrome (peroxisomal membrane protein-70, PMP70), and hyperinsulinemic hypoglycemia 
(sulfonylurea receptor, SUR). Overexpression of the multidrug resistance (MDR) protein, another 

30 ABC transporter, in human cancer cells makes the cells resistant to a variety of cytotoxic drugs used 
in chemotherapy (Taglicht, D. and S. Michaelis (1998) Methods Enzymol. 292:131-163). 

The nuclear pore complex (NPC) is a large multiprotein complex spanning the nuclear 
envelope which mediates the transport of proteins and RNA molecules between the nucleus and the 
cytoplasm, thus contributing to the regulation of gene expression. The NPC allows passive diffusion 

35 of ions, small molecules, and macromolecules under about 60kD, while larger macromolecules are 
transported by facilitated, energy-dependent pathways. Nuclear localization signals (NLS), consisting 
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of short stretches of amino acids enriched in basic residues, are found on proteins that are targeted to 
the nucleus, such as the glucocorticoid receptor. The NLS is recognized by the NLS receptor, 
importin, which then interacts with the monomeric GTP-binding protein Ran. This NLS 
protein/receptor/Ran complex navigates the nuclear pore with the help of the homodimeric protein 
nuclear transport factor 2 (NTF2) (Nakielny, S. and G. Dreyfuss (1997) Curr. Opin. Cell Biol. 9:420- 
429; Gorlich, D. (1997) Curr. Opin. Cell Biol. 9:412-419). Four O-linked glycoproteins, p62, p58, 
p54, and p45, exist as a stable **p62 complex" that forms a ring localized on both nucleoplasm^ and 
cytoplasmic surfaces of the NPC. The p62, p58, and p54 proteins all interact directly with the 
cytosolic transport factors p97 and NTF2, suggesting that the p62 complex is an important ligand 
binding site near the central gated channel of the NPC (Hu, T. et al. (1996) J. Cell Biol. 134:589-601). 

Transport can also occur through intercellular bridges which connect the cytoplasms of sister 
cells, for example in the male and female germline of species ranging from fruit flies to humans. 
These bridges allow passage of cytoplasmic materials between cells during development. 
Intercellular bridges have also been found to connect somatic cells. The nurse cells and oocyte of a 
Drosophila egg chamber, which are derived from a single precursor cell through four rounds of 
mitosis, are connected to each other through intercellular bridges called ring canals. The cells do not 
completely separate after mitosis; the mitotic cleavage furrows are transformed into ring canals by the 
addition of an actin cytoskeleton lining the tunnels between the cells. The Drosophila kelch protein 
functions in organizing actin in the ring canal. Mutations in kelch cause female sterility in 
Drosophila. Kelch contains four protein domains: the NTR domain at the N-terminus, the BTB or 
POZ domain, the IVR or intervening region; and the kelch repeat domain, which contains six 50- 
amino acid kelch repeats. The BTB or POZ domain, a 120-amino acid motif that is also found in 
several zinc-finger containing transcription factors, may be important in dimerization of kelch. Kelch 
repeats are found in other proteins as well and may be important for actin binding (Robinson, D.N. 
and L. Cooley (1997) J. Cell Biol. 138:799-810; Cooley, L. (1998) Cell 93:913-915). 

Ion Channels 

The electrical potential of a cell is generated and maintained by controlling the movement of 
ions across the plasma membrane. The movement of ions requires ion channels, which form an ion- 
selective pore within the membrane. Ion channels share common structural and mechanistic themes. 
The channel consists of four or five subunits or protein monomers that are arranged like a barrel in 
the plasma membrane. Each subunit typically consists of six potential transmembrane segments (SI, 
S2, S3, S4, S5, and S6). The center of the barrel forms a pore lined by ct-helices or P-strands. The 
side chains of the amino acid residues comprising the a-helices or P-strands establish the charge 
(cation or anion) selectivity of the channel. The degree of selectivity, or what specific ions are 
allowed to pass through the channel depends on the diameter of the narrowest part of the pore. There 
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are two basic types of ion channels, ion transporters and gated ion channels. Ion transporters utilize 
the energy obtained from ATP hydrolysis to actively transport an ion against the ion's concentration 



under restricted conditions. Together, these types of ion channels generate, maintain, and utilize an 

5 electrochemical gradient that is used in 1) electrical impulse conduction down the axon of a nerve 
cell, 2) transport of molecules into cells against concentration gradients, 3) initiation of muscle 
contraction, and 4) endocrine cell secretion. 

Transmembrane ATPases are divided into three families. The phosphorylated (P) class ion 
transporters, including Na + -K + ATPase, Ca 2+ ATPase, H + ATPase, and Cu~ ATPase, are activated by 

10 a phosphorylation event. P-class ion transporters are responsible for maintaining resting potential 
distributions such that cytosolic concentrations of Na + and Ca 2+ are low and cytosolic concentration 
of K + is high. The vacuolar (V) class of ion transporters include H + pumps on intracellular 
organelles, such as lysosomes and Golgi. V-ciass ion transporters are responsible for generating the 
low pH within the lumen of these organelles that is required for function. The coupling factor (F) 

15 class consists of H + pumps in the mitochondria. F-class ion transporters utilize a proton gradient to 
generate ATP from ADP and inorganic phosphate (P 4 ). 

Cu~ ATPases export copper from cells (PROS1TE PDOC00139 E1-E2 ATPases 
phosphorylation site). Mutations in one Cu~ ATPase cause Wilson disease, in which toxic amounts 
of copper accumulate in a number of organs, particularly the liver and brain (Tanzi, R.E. et al. (1993) 

20 Nat. Genet. 5:344-350). Mutations in another Cu^ ATPase cause Menkes disease and occipital horn 
syndrome. Menkes disease mutations block export of copper from the gastrointestinal tract, leading 
to skeletal abnormalities, severe mental retardation, neurologic degeneration, and mortality in early 
childhood (Harrison, M.D. and C.T. Dameron (1999) J. Biochem. Mol. Toxicol. 13:93-106). 
Occipital horn syndrome mutations cause connective tissue defects (Harrison, supra; Levinson, B. et 

25 al. (1996) Hum. Moi. Genet. 5:1737-1742). 

The coupling factor (F) class of ion transporters consists of H + pumps in mitochondria, 
chloroplasts, and bacteria. For example, the FoF, ATPase utilizes a proton gradient across the inner 
mitochondrial membrane to generate ATP from ADP and inorganic phosphate (P^. The FoF, ATPase 
is composed of the F 0 complex, which is the transmembrane channel through which protons flow, and 

30 the F, complex, where ATP synthesis activity resides. F 0 has three subunits, A (also known as protein 
6), B, and C (Lodish, H. et al. ( 1995) Molecular Cell Biology , Scientific American Books, New York 
NY, pp. 752-756; PROSITE PDOC00420 ATP synthase a subunit signature). 

Voltage-gated Ca 2+ channels are involved in presynaptic neurotransmitter release, and heart 
and skeletal muscle contraction. The voltage-gated Ca 2+ channels from skeletal muscle (L-type) and 

35 brain (N-type) have been purified and, though their functions differ dramatically, they have similar 
subunit compositions. The channels are composed of three subunits. The a t subunit forms the 



gradient. Gated ion channels allow passive flow of an ion down the ion's electrochemical gradient 
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membrane pore and voltage sensor, while the o>8 and J3 subunits modulate the voltage-dependence, 
gating properties, and the current amplitude of the channel. These subunits are encoded by at least 
six a„ one a^S, and four p genes. A fourth subunit, y, has been identified in skeletal muscle (Walker, 
D. et al. (1998) J. Biol. Chem. 273:2361-2367; and Jay, S.D. et al. (1990) Science 248:490-492). The 
5 human (34 subunit is homologous to the mouse epilepsy gene lethargic , and is a candidate for 

involvement in neurological disorders including ataxia and absence epilepsy (Escayg, A. et al. (1998) 
Genomics 50:14-22), 

Ligand-gated channels open their pores when an extracellular or intracellular mediator binds 
to the channel. Neurotransmitter-gated channels are channels that open when a neurotransmitter 

10 binds to their extracellular domain. These channels exist in the postsynaptic membrane of nerve or 
muscle cells. There are two types of neurotransmitter-gated channels. Sodium channels open in 
response to excitatory neurotransmitters, such as acetylcholine, glutamate, and serotonin. This 
opening causes an influx of Na + and produces the initial localized depolarization that activates the 
voltage-gated channels and starts the action potential. Chloride channels open in response to 

15 inhibitory neurotransmitters, such as y-aminobutyric acid (GABA) and glycine, leading to 
hyperpolarization of the membrane and the subsequent generation of an action potential. 

Ion channels are expressed in a number of tissues where they are implicated in a variety of 
processes. CNG channels, while abundantly expressed in photoreceptor and olfactory sensory cells, 
are also found in kidney, lung, pineal, retinal ganglion cells, testis, aorta, and brain. Calcium-activated 

20 K + channels may be responsible for the vasodilatory effects of bradykinin in the kidney and for 

shunting excess K + from brain capillary endothelial cells into the blood. They are also implicated in 
repolarizing granulocytes after agonist-stimulated depolarization (Ishi, T.M. et al. (1997) Proc. Natl. 
Acad. Sci. USA 94: 1 1651-1 1656). Another transmembrane protein, the leukotrine B4 receptor (BLT) 
appears to be involved in inflammation responses and host cell defense against infection. BLT also 

25 functions as an HIV coreceptor (Izumi, T. et al. (1997) Nature 387:620-624; Martin, V. et al. (1999) 
J. Biol. Chem. 274:8597-8603). 

Ion channels have been the target for many drug therapies. Neurotransmitter-gated channels 
have been targeted in therapies for treatment of insomnia, anxiety, depression, and schizophrenia. 
Voltage-gated channels have been targeted in therapies for arrhythmia, ischemic stroke, head trauma, 

30 and neurodegenerative disease (Taylor, CP. and L.S. Narasimhan (1997) Adv. Pharmacol. 39:47-98). 
K + channels are located in all cell types, and may be regulated by voltage, ATP 
concentration, or second messengers such as Ca** and cAMP. In non-excitable tissue, K + channels 
are involved in protein synthesis, control of endocrine secretions, and the maintenance of osmotic 
equilibrium across membranes. In neurons and other excitable cells, in addition to regulating action 

35 potentials and repolarizing membranes, K + channels are responsible for setting resting membrane 

potential. The cytosol contains non-diffusible anions and, to balance this net negative charge, the cell 
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contains a Na + -K + pump and ion channels that provide the redistribution of Na + , K + , and CI*. The 
pump actively transports Na + out of the cell and K + into the cell in a 3:2 ratio. Ion channels in the 
plasma membrane allow K + and CI to flow by passive diffusion. Because of the high negative charge 
within the cytosol, CI ' flows out of the cell. The flow of K* is balanced by an electromotive force 

5 pulling K + into the cell, and a K + concentration gradient pushing K + out of the cell. Thus, the resting 
membrane potential is primarily regulated by K + fIow (Salkoff, L. and T. Jegla (1995) Neuron 15:489- 
492). Information on NY-REN-45, a K+ channel integral membrane protein, can be found in 
Scanlan, M.J. et al. (1998; Int. J. Cancer 76:652-658). The emopamil-binding protein (EBP) shares 
structural features with both pro- and eukaryotic drug transport proteins (Hanner, M. et al. (1995) J. 

10 Biol. Chem. 270:7551-7557), The Na+ channel, transmembrane protein myelin protein zero (MPZ) 
may be responsible for some sporadic cases of Dejerine-Scottas disease (hereditary motor and 
sensory neuropathy type III) (Hayasaka, K. et al. (1993) Nat. Genet. 5:266-268). 

K + pore-forming subunits generally have six transmembrane-spanning domains with a short 
region between the fifth and sixth transmembrane regions that senses membrane potential; and the 

15 amino and carboxy termini are located intracellularly. In mammalian heart, the duration of 
ventricular action potential is controlled by a K + current. Thus, the K + channel is central to the 
control of heart rate and rhythm. K + channel dysfunctions are associated with a number of renal 
diseases including hypertension, hypokalemia, and the associated Bartter's syndrome and Getelman's 
syndrome, as well as neurological disorders including epilepsy. K + channels have been implicated in 

20 Alzheimer's disease by observations that a significant component of senile plaques, beta amyloid or 
A beta, also blocks voltage-gated potassium channels in hippocampal neurons (Antes, L.M. et al. 
(1998) Seminar Nephrol. 18:31-45; Stoffel, M. and L.Y. Jan (1998) Nat. Genet. 18:6-8; Madeja, M. 
et al. (1997) Eur. J. Neurosci. 9:390-395; Good, T.A. et al. (1996) Biophys. J. 70:296-304). 

Gated ion channels control ion flow by regulating the opening and closing of pores. These 

25 channels are categorized according to the manner of regulating the gating function. Mechanically- 
gated channels open pores in response to mechanical stress, voltage-gated channels open pores in 
response to changes in membrane potential, and ligand-gated channels open pores in the presence of a 
specific ion, nucleotide, or neurotransmitter. 

Voltage-gated Na + channels are responsible for electrical excitability of neurons, skeletal 

30 muscle, heart, and neuroendocrine tissues. For example, the sequential opening and closing of 

voltage-gated Na + channels results in the propagation of action potentials down neuronal axons. Na + 
channels isolated from rat brain tissue are heterotrimeric complexes composed of a 260 kDa pore 
forming a subunit that associates with two smaller auxiliary subunits, pi and P2. The p2 subunit is 
an integral membrane glycoprotein that contains an extracellular Ig domain, and its association with a 

35 and pi subunits correlates with increased function of the channel, a change in the channel's gating 
properties, as well as an increase in whole cell capacitance (Isom, L.L. et al. (1995) Cell 83:433-442). 
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Integral Membrane Proteins 

The majority of known integral membrane proteins are transmembrane proteins (TM) which 
are characterized by an extracellular, a transmembrane, and an intracellular domain. TM domains are 
5 typically comprised of 15 to 25 hydrophobic amino acids which are predicted to adopt an a-helical 
conformation. TM proteins are classified as bitopic (Types I and H) and polytopic (Types III and IV) 
(Singer, SJL (1990) Annu. Rev. Cell Biol. 6:247-96). Bitopic proteins span the membrane once while 
polytopic proteins contain multiple membrane-spanning segments. TM proteins that act as cell- 
surface receptor proteins involved in signal transduction include growth and differentiation factor 
10 receptors, and receptor-interacting proteins such as Drosophila pecanex and frizzled proteins, LIV-1 
protein, NF2 protein, and GNSI/SUR4 eukaryotic integral membrane proteins. TM proteins also act 
as transporters of ions or metabolites, such as gap junction channels (connexins) and ion channels, 
and as cell anchoring proteins, such as lectins, integrins, and fibronectins. TM proteins act as vesicle 
organelle-forming molecules, such as calveolins, or as cell recognition molecules, such as cluster of 
15 differentiation (CD) antigens, glycoproteins, and mucins. Information on connexin can be found in 
Kanter, H.L. et al. (1994; J. Mol. Cell. Cardiol. 26:861-868). 

Many membrane proteins (MPs) contain amino acid sequence motifs that target these proteins 
to specific subcellular sites. Examples of these motifs include PDZ domains, KDEL, RGD, NGR, and 
GSL sequence motifs, von Willebrand factor A (vWFA) domains, and EGF-like domains. RGD, 
20 NGR, and GSL motif-containing peptides have been used as drug delivery agents in cancer treatments 
which target tumor vasculature (Arap, W. et al. (1998) Science, 279:377-380.) Furthermore, MPs 
may also contain amino acid sequence motifs, such as the carbohydrate recognition domain (CRD), 
also known as the C-type lectin domain, that mediate interactions with extracellular or intracellular 
molecules. 

25 G-protein coupled receptors (GPCR) comprise a superfamily of integral membrane proteins 

which transduce extracellular signals. GPCRs include receptors for biogenic amines, lipid mediators 
of inflammation, peptide hormones, and sensory signal mediators. The structure of these 
highly-conserved receptors consists of seven hydrophobic transmembrane regions, an extracellular 
N-terminus, and a cytoplasmic C-terminus. Three extracellular loops alternate with three intracellular 

30 loops to link the seven transmembrane regions. The most conserved parts of these proteins are the 
transmembrane regions and the first two cytoplasmic loops. Cysteine disulfide bridges connect the 
second and third extracellular loops. A conserved, acidic-Arg-aromatic residue triplet present in the 
second cytoplasmic loop may interact with G proteins. A GPCR consensus pattern is characteristic of 
most proteins belonging to this superfamily (ExPASy PROSITE document PS00237; and Watson, 

35 S. and S. Arkinstall (1994) The G-protein Linked Recep tor Facts Book, Academic Press, San Diego 
CA, pp 2-6). Mutations and changes in transcriptional activation of GPCR-encoding genes have been 
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associated with neurological disorders such as schizophrenia, Parkinson's disease, Alzheimer's 
disease, drug addiction, and feeding disorders. 

Cytochromes are electron-transferring proteins that contain a heme prosthetic group, a 
porphyrin ring containing a tightly bound iron atom. Cytochromes act as oxidoreductases in such 
5 diverse cellular processes as respiration, photosynthesis, fatty acid metabolism, and neurotransmitter 
biosynthesis. The heme iron atom serves as the actual electron carrier by changing from the ferric to 
the ferrous oxidation state when accepting an electron. Cytochromes accept electrons from one 
substrate such as NADH or ascorbate and donate them to other electron carriers such as other 
cytochromes, ubiquinone, or semidehydroascorbic acid (Lodish, H. et al. (1995) Molecular Cell 
10 Biology . Scientific American Books, New York NY, pp. 759-770, 786-797; Sperling, P. et al. (1995) 
Eur. J. Biochem. 232:798-805; and Online Mendelian Inheritance in Man (OMIM) *600019 
Cytochrome b561, CYB561). 

Cytochrome b5 is an electron donor in membrane-linked redox enzyme systems involved in 
lipid and drug metabolism. Cytochrome b5 has been found in Golgi, plasma, outer mitochondrial, 
15 endoplasmic reticulum (ER), and microbody membranes. Conserved amino acids in cytochrome b5 
include eight invariant amino acids at W34, H51, P52, G53, G54, G63, F70, and H74, and fifteen 
conserved amino acids at L24, 135, S36, V41, Y42, N43, T45, W47, A48, L58, D65, T67, L85, T87, 
and G88 (numbering based on the sunflower cytochrome b5/delta-6 desaturase fusion protein; GI 
1040729, Sperling, supra) . The invariant residues H51PGG are involved in heme-binding. 
20 Cytochrome b5-like domains have also been found linked to other enzymes. For example, 

cytochrome b5-like domains are part of delta-9 fatty acid desaturases in yeast and Histoplasma 
capsulatum . nitrate reductase, sulfite reductase, flavocytochrome b2, Arahidopsis thaliana acyl lipid 
desaturase, and Borago officinalis (borage) and Helianthus annuus (sunflower) delta-6 desaturases 
(Sperling, supra ; Sayanova, O. et al (1997) Proc. Natl. Acad. Sci. USA 94:421 1-4216; and Mitchell, 
25 A.G. and C.E. Martin (1997) J. Biol. Chem. 272:28281-28288). 

Signal peptides are found on proteins that are targeted to the endoplasmic reticulum (ER). 
Signal peptides consist of stretches of amino acids enriched in hydrophobic residues. Signal peptides 
are usually found at the extreme N-terminus of the protein and are recognized by a cytosolic signal- 
recognition peptide (SRP). The SRP binds to the signal peptide and to an SRP receptor, an integral 
30 membrane protein in the ER. Once bound to the SRP receptor, the newly formed protein containing 
the signal peptide is translocated across the ER membrane. Proteins containing signal peptides may 
end up inserted into the lipid bilayer, or they may end up in the lumen of an organelle or secreted from 
the cell. 

35 Disease Correlation 

The etiology of numerous human diseases and disorders can be attributed to defects in the 
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transport of molecules across membranes. Defects in the trafficking of membrane-bound transporters 
and ion channels are associated with several disorders, e.g. cystic fibrosis, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, von Gierke disease, and certain forms of diabetes 
mellitus. Single-gene defect diseases resulting in an inability to transport small molecules across 
5 membranes include, e.g., cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease (vant Hoff, 
W.G. (1996) Exp. Nephrol. 4:253-262; Talente, G.M. et al. (1994) Ann. Intern. Med. 120:218-226; 
and Chillon, M. et al. (1995) New Engl. J. Med. 332: 1475-1480). 

Cystinuria is an inherited disease that results from the inability to transport cystine, the 
disulfide-linked dimer of cysteine, from the urine into the blood. Accumulation of cystine in the 
10 urine leads to the formation of cystine stones in the kidneys. 

Transthyretin (TTR), present in human plasma, binds to and transports the thyroid hormone 
thyroxine. Mutations in TTR result in the conversion of TTR to amyloid, an insoluble fibrillar 
structure. The resulting amyloid plaques have been shown to be the causative agent in the 
development of familial amyloid polyneuropathy and senile systemic amyloidosis (Miroy, G.J. et al. 
15 (1996) Proc. Natl. Acad. Sci. USA 93:15051-15056). 

Stomatin, a 31-kDa erythrocyte integral membrane protein has been linked to the hereditary 
anemia stomatocytosis. This anemia is characterized by red blood cells that lack stomatin and leak 
Na+ and K+. Thus, stomatin is presumed to play a role in the regulation of ion transport. Red blood 
cell ion transport defects are also linked to other disorders such as hypertension (Stewart, G.W. 
20 (1997) Int. J. Biochem. Cell Biol. 29:271-274). 

The discovery of new human transport proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
prevention, and treatment of transport, metabolic, neurological, cardiovascular, reproductive, and 
25 immune disorders, and cell proliferative disorders including cancer. 

SUMMARY OF THE INVENTION 

The invention features purified polypeptides, human transport proteins, referred to 
collectively as "TPPT" and individually as 'TPPT-1 " "TPPT-2 "TPPT-3," "TPPT-4," 'TPPT-5," 

30 "TPPT-6," "TPPT-7," 'TPPT-8," "TPPT-9," 'TPPT-10," "TPPT-l 1," "TPPT-12 " 'TPPT-13/' 
"TPPT-14," "TPPT-15," 'TPPT-16," "TPPT-17," "TPPT-18," 'TPPT-19," "TPPT-20," "TPPT-21 
"TPPT-22," "TPPT-23," "TPPT-24," "TPPT-25" "TPPT-26," "TPPT-27," "TPPT-28," "TPPT-29," 
"TPPT-30," l TPPT-31," 'TPPT-32/' 'TPPT-33," "TPPT-34," 'TPPT-35/' 'TPPT-36/' "TPPT-37," 
"TPPT-38," *TPPT-39," 'TPPT-40," "TPPT-41 " "TPPT-42," and "TPPT-43." In one aspect, the 

35 invention provides an isolated polypeptide comprising an amino acid sequence selected from the 
group consisting of a) an amino acid sequence selected from the group consisting of SEQ ID NO:l- 
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43, b) a naturally occurring amino acid sequence having at least 90% sequence identity to an amino 
acid sequence selected from the group consisting of SEQ ED NO: 1-43, c) a biologically active 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an 
immunogenic fragment of an amino acid sequence selected from the group consisting of SEQ ID 

5 NO: 1-43. In one alternative, the invention provides an isolated polypeptide comprising the amino 
acid sequence of SEQ ID NO: 1-43. 

The invention further provides an isolated polynucleotide encoding a polypeptide comprising 
an amino acid sequence selected from the group consisting of a) an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring amino acid sequence having at 

10 least 90% sequence identity to an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-43, c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1-43. In one alternative, the polynucleotide encodes a 
polypeptide selected from the group consisting of SEQ ID NO: 1-43. In another alternative, the 

15 polynucleotide is selected from the group consisting of SEQ ID NO:44-86. 

Additionally, the invention provides a recombinant polynucleotide comprising a promoter 
sequence operably linked to a polynucleotide encoding a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-43, b) a naturally occurring amino acid sequence having at least 90% 

20 sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, 
c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43. In one alternative, the invention provides a cell transformed 
with the recombinant polynucleotide. In another alternative, the invention provides a transgenic 

25 organism comprising the recombinant polynucleotide. 

The invention also provides a method for producing a polypeptide comprising an amino acid 
sequence selected from the group consisting of a) an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-43, b) a naturally occurring amino acid sequence having at least 90% 
sequence identity to an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, 

30 c) a biologically active fragment of an amino acid sequence selected from the group consisting of 
SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43. The method comprises a) culturing a cell under conditions 
suitable for expression of the polypeptide, wherein said cell is transformed with a recombinant 
polynucleotide comprising a promoter sequence operably linked to a polynucleotide encoding the 

35 polypeptide, and b) recovering the polypeptide so expressed. 

Additionally, the invention provides an isolated antibody which specifically binds to a 
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polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring amino 
acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 

5 selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43. 

The invention further provides an isolated polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 

10 sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
to b), and e) an RNA equivalent of a)-d). In one alternative, the polynucleotide comprises at least 60 
contiguous nucleotides. 

Additionally, the invention provides a method for detecting a target polynucleotide in a 

15 sample, said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 
consisting of SEQ ID NO:44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 

20 to b), and e) an RNA equivalent of a)-d). The method comprises a) hybridizing the sample with a 
probe comprising at least 20 contiguous nucleotides comprising a sequence complementary to said 
target polynucleotide in the sample, and which probe specifically hybridizes to said target 
polynucleotide, under conditions whereby a hybridization complex is formed between said probe and 
said target polynucleotide or fragments thereof, and b) detecting the presence or absence of said 

25 hybridization complex, and optionally, if present, the amount thereof. In one alternative, the probe 
comprises at least 60 contiguous nucleotides. 

The invention further provides a method for detecting a target polynucleotide in a sample, 
said target polynucleotide having a sequence of a polynucleotide comprising a polynucleotide 
sequence selected from the group consisting of a) a polynucleotide sequence selected from the group 

30 consisting of SEQ ID NO.44-86, b) a naturally occurring polynucleotide sequence having at least 70% 
sequence identity to a polynucleotide sequence selected from the group consisting of SEQ ID NO:44- 
86, c) a polynucleotide sequence complementary to a), d) a polynucleotide sequence complementary 
to b), and e) an RNA equivalent of a)-d). The method comprises a) amplifying said target 
polynucleotide or fragment thereof using polymerase chain reaction amplification, and b) detecting 

35 the presence or absence of said amplified target polynucleotide or fragment thereof, and, optionally, if 
present, the amount thereof. 
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The invention further provides a pharmaceutical composition comprising an effective amount 
of a polypeptide comprising an amino acid sequence selected from the group consisting of a) an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring 
amino acid sequence having at least 90% sequence identity to an amino acid sequence selected from 
5 the group consisting of SEQ ID NO:l-43, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, and a pharmaceutical^ 
acceptable excipient. In one embodiment, the pharmaceutical composition comprises an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-43. The invention additionally provides 
10 a method of treating a disease or condition associated with decreased expression of functional TPPT, 
comprising administering to a patient in need of such treatment the pharmaceutical composition. 

The invention also provides a method for screening a compound for effectiveness as an 
agonist of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 
15 occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 -43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
20 agonist activity in the sample. In one alternative, the invention provides a pharmaceutical 

composition comprising an agonist compound identified by the method and a pharmaceutical ly 
acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with decreased expression of functional TPPT, comprising administering to a 
patient in need of such treatment the pharmaceutical composition. 
25 Additionally, the invention provides a method for screening a compound for effectiveness as 

an antagonist of a polypeptide comprising an amino acid sequence selected from the group consisting 
of a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 
occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
30 acid sequence selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) exposing a sample comprising the polypeptide to a compound, and b) detecting 
antagonist activity in the sample. In one alternative, the invention provides a pharmaceutical 
composition comprising an antagonist compound identified by the method and a pharmaceutical^ 
35 acceptable excipient. In another alternative, the invention provides a method of treating a disease or 
condition associated with overexpression of functional TPPT, comprising administering to a patient in 
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need of such treatment the pharmaceutical composition. 

The invention further provides a method of screening for a compound that specifically binds 
to a polypeptide comprising an amino acid sequence selected from the group consisting of a) an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally occurring amino 

5 acid sequence having at least 90% sequence identity to an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino acid sequence 
selected from the group consisting of SEQ ID NO: 1-43, and d) an immunogenic fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1-43. The method comprises a) 
combining the polypeptide with at least one test compound under suitable conditions, and b) 

10 detecting binding of the polypeptide to the test compound, thereby identifying a compound that 
specifically binds to the polypeptide. 

The invention further provides a method of screening for a compound that modulates the 
activity of a polypeptide comprising an amino acid sequence selected from the group consisting of a) 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43, b) a naturally 

15 occurring amino acid sequence having at least 90% sequence identity to an amino acid sequence 

selected from the group consisting of SEQ ID NO: 1-43, c) a biologically active fragment of an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 -43, and d) an immunogenic 
fragment of an amino acid sequence selected from the group consisting of SEQ ID NO: 1-43. The 
method comprises a) combining the polypeptide with at least one test compound under conditions 

20 permissive for the activity of the polypeptide, b) assessing the activity of the polypeptide in the 

presence of the test compound, and c) comparing the activity of the polypeptide in the presence of the 
test compound with the activity of the polypeptide in the absence of the test compound, wherein a 
change in the activity of the polypeptide in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide. 

25 The invention further provides a method for screening a compound for effectiveness in 

altering expression of a target polynucleotide, wherein said target polynucleotide comprises a 
sequence selected from the group consisting of SEQ ID NO:44-86, the method comprising a) 
exposing a sample comprising the target polynucleotide to a compound, and b) detecting altered 
expression of the target polynucleotide. 

30 

BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows polypeptide and nucleotide sequence identification numbers (SEQ ID NOs), 
clone identification numbers (clone IDs), cDNA libraries, and cDNA fragments used to assemble full- 
length sequences encoding TPPT. 
35 Table 2 shows features of each polypeptide sequence, including potential motifs, homologous 

sequences, and methods, algorithms, and searchable databases used for analysis of TPPT. 



14 



WO 00/78953 



PCT/US00/16668 



Table 3 shows selected fragments of each nucleic acid sequence; the tissue-specific 
expression patterns of each nucleic acid sequence as determined by northern analysis; diseases, 
disorders, or conditions associated with these tissues; and the vector into which each cDNA was 
cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which cDNA clones 
encoding TPPT were isolated. 

Table 5 shows the tools, programs, and algorithms used to analyze the polynucleotides and 
polypeptides of the invention, along with applicable descriptions, references, and threshold 
parameters. 

DESCRIPTION OF THE INVENTION 

Before the present proteins, nucleotide sequences, and methods are described, it is understood 
that this invention is not limited to the particular machines, materials and methods described, as these 
may vary. It is also to be understood that the terminology used herein is for the purpose of describing 
particular embodiments only, and is not intended to limit the scope of the present invention which will 
be limited only by the appended claims. 

It must be noted that as used herein and in the appended claims, the singular forms "a," "an," 
and "the" include plural reference unless the context clearly dictates otherwise. Thus, for example, a 
reference to "a host cell" includes a plurality of such host cells, and a reference to "an antibody" is a 
reference to one or more antibodies and equivalents thereof known to those skilled in the an, and so 
forth. 

Unless defined otherwise, all technical and scientific terms used herein have the same 
meanings as commonly understood by one of ordinary skill in the art to which this invention belongs. 
Although any machines, materials, and methods similar or equivalent to those described herein can be 
used to practice or test the present invention, the preferred machines, materials and methods are now 
described. All publications mentioned herein are cited for the purpose of describing and disclosing 
the cell lines, protocols, reagents and vectors which are reported in the publications and which might 
be used in connection with the invention. Nothing herein is to be construed as an admission that the 
invention is not entitled to antedate such disclosure by virtue of prior invention. 
DEFINITIONS 

"TPPT" refers to the amino acid sequences of substantially purified TPPT obtained from any 
species, particularly a mammalian species, including bovine, ovine, porcine, murine, equine, and 
human, and from any source, whether natural, synthetic, semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which intensifies or mimics the biological activity of 
TPPT. Agonists may include proteins, nucleic acids, carbohydrates, small molecules, or any other 
compound or composition which modulates the activity of TPPT either by directly interacting with 
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TPPT or by acting on components of the biological pathway in which TPPT participates. 

An "allelic variant" is an alternative form of the gene encoding TPPT. Allelic variants may 
result from at least one mutation in the nucleic acid sequence and may result in altered mRNAs or in 
polypeptides whose structure or function may or may not be altered. A gene may have none, one, or 
many allelic variants of its naturally occurring form. Common mutational changes which give rise to 
allelic variants are generally ascribed to natural deletions, additions, or substitutions of nucleotides. 
Each of these types of changes may occur alone, or in combination with the others, one or more times 
in a given sequence. 

"Altered" nucleic acid sequences encoding TPPT include those sequences with deletions, 
insertions, or substitutions of different nucleotides, resulting in a polypeptide the same as TPPT or a 
polypeptide with at least one functional characteristic of TPPT. Included within this definition are 
polymorphisms which may or may not be readily detectable using a particular oligonucleotide probe 
of the polynucleotide encoding TPPT, and improper or unexpected hybridization to allelic variants, 
with a locus other than the normal chromosomal locus for the polynucleotide sequence encoding 
TPPT. The encoded protein may also be "altered," and may contain deletions, insertions, or 
substitutions of amino acid residues which produce a silent change and result in a functionally 
equivalent TPPT. Deliberate amino acid substitutions may be made on the basis of similarity in 
polarity, charge, solubility, hydrophobicity, hydrophilicity, and/or the amphipathic nature of the 
residues, as long as the biological or immunological activity of TPPT is retained. For example, 
negatively charged amino acids may include aspartic acid and glutamic acid, and positively charged 
amino acids may include lysine and arginine. Amino acids with uncharged polar side chains having 
similar hydrophilicity values may include: asparagine and glutamine; and serine and threonine. 
Amino acids with uncharged side chains having similar hydrophilicity values may include: leucine, 
isoleucine, and valine; glycine and alanine; and phenylalanine and tyrosine. 

The terms "amino acid" and "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or synthetic 
molecules. Where "amino acid sequence" is recited to refer to a sequence of a naturally occurring 
protein molecule, "amino acid sequence" and like terms are not meant to limit the amino acid 
sequence to the complete native amino acid sequence associated with the recited protein molecule. 

"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term "antagonist" refers to a molecule which inhibits or attenuates the biological activity 
of TPPT. Antagonists may include proteins such as antibodies, nucleic acids, carbohydrates, small 
molecules, or any other compound or composition which modulates the activity of TPPT either by 
directly interacting with TPPT or by acting on components of the biological pathway in which TPPT 
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participates. 

The term "antibody" refers to intact immunoglobulin molecules as well as to fragments 
thereof, such as Fab, F(ab') 2 , and Fv fragments, which are capable of binding an epitopic determinant. 
Antibodies that bind TPPT polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide used 
to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of RNA, 
or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly used 
carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, and 
keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 

The term "antigenic determinant" refers to that region of a molecule (i.e., an epitope) that 
makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (particular regions or three-dimensional structures 
on the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition capable of base-pairing with the "sense" 
(coding) strand of a specific nucleic acid sequence. Antisense compositions may include DNA; RNA; 
peptide nucleic acid (PNA); oligonucleotides having modified backbone linkages such as 
phosphorothioates, methylphosphonates, or benzylphosphonates; oligonucleotides having modified 
sugar groups such as 2'-methoxyethyl sugars or Z-methoxyethoxy sugars; or oligonucleotides having 
modified bases such as 5-methyl cytosine, 2-deoxyuracil, or 7-deaza-2*-deoxyguanosine. Antisense 
molecules may be produced by any method including chemical synthesis or transcription. Once 
introduced into a cell, the complementary antisense molecule base-pairs with a naturally occurring 
nucleic acid sequence produced by the cell to form duplexes which block either transcription or 
translation. The designation "negative" or "minus" can refer to the antisense strand, and the 
designation "positive" or "plus" can refer to the sense strand of a reference DNA molecule. 

The term "biologically active" refers to a protein having structural, regulatory, or biochemical 
functions of a naturally occurring molecule. Likewise, "immunologically active" or "immunogenic" 
refers to the capability of the natural, recombinant, or synthetic TPPT, or of any oligopeptide thereof, 
to induce a specific immune response in appropriate animals or cells and to bind with specific 
antibodies. 

"Complementary" describes the relationship between two single-stranded nucleic acid 
sequences that anneal by base-pairing. For example, S'-AGT-S* pairs with its complement, 
3'-TCA-5\ 

A "composition comprising a given polynucleotide sequence" and a "composition comprising 
a given amino acid sequence" refer broadly to any composition containing the given polynucleotide or 
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amino acid sequence. The composition may comprise a dry formulation or an aqueous solution. 
Compositions comprising polynucleotide sequences encoding TPPT or fragments of TPPT may be 
employed as hybridization probes. The probes may be stored in freeze-dried form and may be 
associated with a stabilizing agent such as a carbohydrate. In hybridizations, the probe may be 

5 deployed in an aqueous solution containing salts (e.g., NaCl), detergents (e.g., sodium dodecyl sulfate; 
SDS), and other components (e.g., Denhardfs solution, dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been subjected to repeated 
DNA sequence analysis to resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, 
Foster City CA) in the 5* and/or the 3' direction, and resequenced, or which has been assembled from 

10 one or more overlapping cDNA, EST, or genomic DNA fragments using a computer program for 
fragment assembly, such as the GEL VIEW fragment assembly system (GCG, Madison WI) or Phrap 
(University of Washington, Seattle WA). Some sequences have been both extended and assembled to 
produce the consensus sequence. 

"Conservative amino acid substitutions" are those substitutions that are predicted to least 

15 interfere with the properties of the original protein, i.e., the structure and especially the function of the 
protein is conserved and not significantly changed by such substitutions. The table below shows 
amino acids which may be substituted for an original amino acid in a protein and which are regarded 
as conservative amino acid substitutions. 





Original Residue 


Conservative Substitution 


20 


Ala 


Gly, Ser 




Arg 


His, Lys 




Asn 


Asp, Gin, His 




Asp 


Asn, Glu 




Cys 


Ala, Ser 


25 


Gin 


Asn, Glu, His 




Glu 


Asp, Gin, His 




Gly 


Ala 




His 


Asn, Arg, Gin, Glu 




He 


Leu, Val 


30 


Leu 


He, Val 




Lys 


Arg, Gin, Glu 




Met 


Leu, He 




Phe 


His, Met, Leu, Trp, Tyr 




Ser 


Cys, Thr 


35 


Thr 


Ser, Val 




Trp 


Phe, Tyr 




Tyr 


His, Phe, Trp 




Val 


He, Leu, Thr 



40 Conservative amino acid substitutions generally maintain (a) the structure of the polypeptide 

backbone in the area of the substitution, for example, as a beta sheet or alpha helical conformation, 
(b) the charge or hydroph obi city of the molecule at the site of the substitution, and/or (c) the bulk of 
the side chain. 
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A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to a chemically modified polynucleotide or polypeptide. 
Chemical modifications of a polynucleotide sequence can include, for example, replacement of 
5 hydrogen by an alkyi, acyl, hydroxy 1, or amino group. A derivative polynucleotide encodes a 

polypeptide which retains at least one biological or immunological function of the natural molecule. 
A derivative polypeptide is one modified by glycosylation, pegylation, or any similar process that 
retains at least one biological or immunological function of the polypeptide from which it was 
derived. 

10 A "detectable label" refers to a reporter molecule or enzyme that is capable of generating a 

measurable signal and is covalently or noncovalently joined to a polynucleotide or polypeptide. 

A "fragment" is a unique portion of TPPT or the polynucleotide encoding TPPT which is 
identical in sequence to but shorter in length than the parent sequence. A fragment may comprise up 
to the entire length of the defined sequence, minus one nucleotide/amino acid residue. For example, a 

15 fragment may comprise from 5 to 1000 contiguous nucleotides or amino acid residues. A fragment 
used as a probe, primer, antigen, therapeutic molecule, or for other purposes, may be at least 5, 10, 15, 
16, 20, 25, 30, 40, 50, 60, 75, 100, 150, 250 or at least 500 contiguous nucleotides or amino acid 
residues in length. Fragments may be preferentially selected from certain regions of a molecule. For 
example, a polypeptide fragment may comprise a certain length of contiguous amino acids selected 

20 from the first 250 or 500 amino acids (or first 25% or 50% of a polypeptide) as shown in a certain 
defined sequence. Clearly these lengths are exemplary, and any length that is supported by the 
specification, including the Sequence Listing, tables, and figures, may be encompassed by the present 
embodiments. 

A fragment of SEQ ID NO:44-86 comprises a region of unique polynucleotide sequence that 
25 specifically identifies SEQ ID NO:44-86, for example, as distinct from any other sequence in the 
genome from which the fragment was obtained. A fragment of SEQ ID NO:44-86 is useful, for 
example, in hybridization and amplification technologies and in analogous methods that distinguish 
SEQ ID NO:44-86 from related polynucleotide sequences. The precise length of a fragment of SEQ 
ID NO:44-86 and the region of SEQ ID NO:44-86 to which the fragment corresponds are routinely 
30 determinable by one of ordinary skill in the art based on the intended purpose for the fragment. 

A fragment of SEQ ID NO: 1-43 is encoded by a fragment of SEQ ID NO:44-86. A fragment 
of SEQ ID NO: 1-43 comprises a region of unique amino acid sequence that specifically identifies 
SEQ ID NO: 1-43. For example, a fragment of SEQ ID NO: 1-43 is useful as an immunogenic peptide 
for the development of antibodies that specifically recognize SEQ ID NO: 1-43. The precise length of 
35 a fragment of SEQ ID NO: 1 -43 and the region of SEQ ID NO: 1 -43 to which the fragment 

corresponds are routinely determinable by one of ordinary skill in the art based on the intended 
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purpose for the fragment. 

A "full-length" polynucleotide sequence is one containing at least a translation initiation 
codon (e.g., methionine) followed by an open reading frame and a translation termination codon. A 
"full-length" polynucleotide sequence encodes a "full-length" polypeptide sequence. 
5 "Homology" refers to sequence similarity or, interchangeably, sequence identity, between two 

or more polynucleotide sequences or two or more polypeptide sequences. 

The terms "percent identity" and "% identity," as applied to polynucleotide sequences, refer 
to the percentage of residue matches between at least two polynucleotide sequences aligned using a 
standardized algorithm. Such an algorithm may insert, in a standardized and reproducible way, gaps 
10 in the sequences being compared in order to optimize alignment between two sequences, and 
therefore achieve a more meaningful comparison of the two sequences. 

Percent identity between polynucleotide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEG ALIGN version 3.12e 
sequence alignment program. This program is part of the LASERGENE software package, a suite of 
15 molecular biological analysis programs (DNASTAR, Madison WI). CLUSTAL V is described in 
Higgins, D.G. and P.M. Sharp (1989) CABIOS 5:151-153 and in Higgins, D.G. et al. (1992) CABIOS 
8:189-191. For pairwise alignments of polynucleotide sequences, the default parameters are set as 
follows: Ktuple=2, gap penalty=5, window=4, and "diagonals saved"=4. The "weighted" residue 
weight table is selected as the default. Percent identity is reported by CLUSTAL V as the "percent 
20 similarity" between aligned polynucleotide sequences. 

Alternatively, a suite of commonly used and freely available sequence comparison algorithms 
is provided by the National Center for Biotechnology Information (NCBI) Basic Local Alignment 
Search Tool (BLAST) (Altschul, S.F. et al. (1990) J. Mol. Biol. 215:403-410), which is available from 
several sources, including the NCBI, Bethesda, MD, and on the Internet at 
25 http://www.ncbi.nlm.nih.gov/BLAST/. The BLAST software suite includes various sequence analysis 
programs including "blastn," that is used to align a known polynucleotide sequence with other 
polynucleotide sequences from a variety of databases. Also available is a tool called "BLAST 2 
Sequences" that is used for direct pairwise comparison of two nucleotide sequences. "BLAST 2 
Sequences" can be accessed and used interactively at http://www.ncbi.nlm.nih.gov/gorf/bl2.htmJ. The 
30 "BLAST 2 Sequences" tool can be used for both blastn and blastp (discussed below). BLAST 

programs are commonly used with gap and other parameters set to default settings. For example, to 
compare two nucleotide sequences, one may use blastn with the "BLAST 2 Sequences" tool Version 
2.0.12 (April-2 1-2000) set at default parameters. Such default parameters may be, for example: 
Matrix: BLOSUM62 
35 Reward for match: I 

Penalty for m ismatch: -2 
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Open Gap: 5 and Extension Gap: 2 penalties 
Gap x drop-off: 50 
Expect: 10 
Word Size: 11 
5 Filter: on 

Percent identity may be measured over the length of an entire defined sequence, for example, 
as defined by a particular SEQ ID number, or may be measured over a shorter length, for example, 
over the length of a fragment taken from a larger, defined sequence, for instance, a fragment of at 
least 20, at least 30, at least 40, at least 50, at least 70, at least 100, or at least 200 contiguous 

10 nucleotides. Such lengths are exemplary only, and it is understood that any fragment length supported 
by the sequences shown herein, in the tables, figures, or Sequence Listing, may be used to describe a 
length over which percentage identity may be measured. 

Nucleic acid sequences that do not show a high degree of identity may nevertheless encode 
similar amino acid sequences due to the degeneracy of the genetic code. It is understood that changes 

15 in a nucleic acid sequence can be made using this degeneracy to produce multiple nucleic acid 
sequences that all encode substantially the same protein. 

The phrases "percent identity** and "% identity," as applied to polypeptide sequences, refer to 
the percentage of residue matches between at least two polypeptide sequences aligned using a 
standardized algorithm. Methods of polypeptide sequence alignment are well-known. Some 

20 alignment methods take into account conservative amino acid substitutions. Such conservative 

substitutions, explained in more detail above, generally preserve the charge and hydrophobicity at the 
site of substitution, thus preserving the structure (and therefore function) of the polypeptide. 

Percent identity between polypeptide sequences may be determined using the default 
parameters of the CLUSTAL V algorithm as incorporated into the MEGALIGN version 3.12e 

25 sequence alignment program (described and referenced above). For pairwise alignments of 

polypeptide sequences using CLUSTAL V, the default parameters are set as follows: Ktuple=l, gap 
penalty=3, window=5, and "diagonals saved"=5. The PAM250 matrix is selected as the default 
residue weight table. As with polynucleotide alignments, the percent identity is reported by 
CLUSTAL V as the "percent similarity" between aligned polypeptide sequence pairs. 

30 Alternatively the NCBI BLAST software suite may be used. For example, for a pairwise 

comparison of two polypeptide sequences, one may use the "BLAST 2 Sequences" tool Version 
2.0.12 (Apr-21-2000) with blastp set at default parameters. Such default parameters may be, for 
example: 

Matrix: BLOSUM62 

35 Open Gap: 11 and Extension Gap: 1 penalties 

Gap x drop-off: 50 
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Expect: 10 
Word Size: 3 
Filter: on 

Percent identity may be measured over the length of an entire defined polypeptide sequence, 
5 for example, as defined by a particular SEQ ID number, or may be measured over a shorter length, for 
example, over the length of a fragment taken from a larger, defined polypeptide sequence, for 
instance, a fragment of at least 15, at least 20, at least 30, at least 40, at least 50, at least 70 or at least 
150 contiguous residues. Such lengths are exemplary only, and it is understood that any fragment 
length supported by the sequences shown herein, in the tables, figures or Sequence Listing, may be 
10 used to describe a length over which percentage identity may be measured. 

"Human artificial chromosomes" (HACs) are linear microchromosomes which may contain 
DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements required for 
chromosome replication, segregation and maintenance. 

The term "humanized antibody" refers to an antibody molecule in which the amino acid 
15 sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 

"Hybridization" refers to the process by which a polynucleotide strand anneals with a 
complementary strand through base pairing under defined hybridization conditions. Specific 
hybridization is an indication that two nucleic acid sequences share a high degree of complementarity. 
20 Specific hybridization complexes form under permissive annealing conditions and remain hybridized 
after the "washing" step(s). The washing step(s) is particularly important in determining the 
stringency of the hybridization process, with more stringent conditions allowing less non-specific 
binding, i.e., binding between pairs of nucleic acid strands that are not perfectly matched. Permissive 
conditions for annealing of nucleic acid sequences are routinely determinable by one of ordinary skill 
25 in the art and may be consistent among hybridization experiments, whereas wash conditions may be 
varied among experiments to achieve the desired stringency, and therefore hybridization specificity. 
Permissive annealing conditions occur, for example, at 68°C in the presence of about 6 x SSC, about 
1% (w/v) SDS, and about 100 (ig/ml sheared, denatured salmon sperm DNA. 

Generally, stringency of hybridization is expressed, in part, with reference to the temperature 
30 under which the wash step is carried out. Such wash temperatures are typically selected to be about 
5°C to 20°C lower than the thermal melting point (T J for the specific sequence at a defined ionic 
strength and pH. The T m is the temperature (under defined ionic strength and pH) at which 50% of 
the target sequence hybridizes to a perfectly matched probe. An equation for calculating T m and 
conditions for nucleic acid hybridization are well known and can be found in Sambrook, J. et al., 
35 1989, Molecular Cloning: A Laboratory Manual 2 nd ed., vol. 1-3, Cold Spring Harbor Press, 
Plainview NY; specifically see volume 2, chapter 9. 
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High stringency conditions for hybridization between polynucleotides of the present invention 
include wash conditions of 68°C in the presence of about 0.2 x SSC and about 0.1% SDS, for 1 hour. 
Alternatively, temperatures of about 65°C, 60°C, 55°C, or 42°C may be used. SSC concentration may 
be varied from about 0.1 to 2 x SSC, with SDS being present at about 0.1%. Typically, blocking 
5 reagents are used to block non-specific hybridization. Such blocking reagents include, for instance, 
sheared and denatured salmon sperm DNA at about 100-200 ug/ml. Organic solvent, such as 
formamide at a concentration of about 35-50% v/v, may also be used under particular circumstances, 
such as for RNA:DNA hybridizations. Useful variations on these wash conditions will be readily 
apparent to those of ordinary skill in the art. Hybridization, particularly under high stringency 
10 conditions, may be suggestive of evolutionary similarity between the nucleotides. Such similarity is 
strongly indicative of a similar role for the nucleotides and their encoded polypeptides. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one 
15 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a solid 
support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate substrate 
to which cells or their nucleic acids have been fixed). 

The words "insertion" and "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively. 
20 "Immune response" can refer to conditions associated with inflammation, trauma, immune 

disorders, or infectious or genetic disease, etc. These conditions can be characterized by expression 
of various factors, e.g., cytokines, chemokines, and other signaling molecules, which may affect 
cellular and systemic defense systems. 

An "immunogenic fragment" is a polypeptide or oligopeptide fragment ofTPPT which is 
25 capable of eliciting an immune response when introduced into a living organism, for example, a 

mammal. The term "immunogenic fragment" also includes any polypeptide or oligopeptide fragment 
ofTPPT which is useful in any of the antibody production methods disclosed herein or known in the 



art. 



The term "microarray" refers to an arrangement of a plurality of polynucleotides, 
30 polypeptides, or other chemical compounds on a substrate. 

The terms "element" and "array element" refer to a polynucleotide, polypeptide, or other 
chemical compound having a unique and defined position on a microarray. 

The term "modulate" refers to a change in the activity of TPPT. For example, modulation 
may cause an increase or a decrease in protein activity, binding characteristics, or any other 
35 biological, functional, or immunological properties of TPPT. 

The phrases "nucleic acid" and "nucleic acid sequence" refer to a nucleotide, oligonucleotide. 



23 



PCT/US00/16668 

WO 00/78953 

polynucleotide, or any fragment thereof. These phrases also refer to DNA or RNA of genomic or 
synthetic origin which may be single-stranded or double-stranded and may represent the sense or the 
antisense strand, to peptide nucleic acid (PNA), or to any DNA-like or RNA-like material. 

"Operably linked" refers to the situation in which a first nucleic acid sequence is placed in a 
5 functional relationship with a second nucleic acid sequence. For instance, a promoter is operably 
linked to a coding sequence if the promoter affects the transcription or expression of the coding 
sequence. Operably linked DNA sequences may be in close proximity or contiguous and. where 
necessary to join two protein coding regions, in the same reading frame. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
10 comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone of 
amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript elongation, 
and may be pegylated to extend their lifespan in the cell. 

"Post-translational modification" of an TPPT may involve lipidation, glycosylation, 
15 phosphorylation, acetylation, racemization, proteolytic cleavage, and other modifications known in 
the art. These processes may occur synthetically or biochemically. Biochemical modifications will 
vary by cell type depending on the enzymatic milieu of TPPT. 

"Probe" refers to nucleic acid sequences encoding TPPT, their complements, or fragments 
thereof, which are used to detect identical, allelic or related nucleic acid sequences. Probes are 
20 isolated oligonucleotides or polynucleotides attached to a detectable label or reporter molecule. 
Typical labels include radioactive isotopes, ligands, chemiluminescent agents, and enzymes. 
"Primers" are short nucleic acids, usually DNA oligonucleotides, which may be annealed to a target 
polynucleotide by complementary base-pairing. The primer may then be extended along the target 
DNA strand by a DNA polymerase enzyme. Primer pairs can be used for amplification (and 
25 identification) of a nucleic acid sequence, e.g., by the polymerase chain reaction (PCR). 

Probes and primers as used in the present invention typically comprise at least 15 contiguous 
nucleotides of a known sequence. In order to enhance specificity, longer probes and primers may also 
be employed, such as probes and primers that comprise at least 20, 25, 30, 40. 50, 60, 70, 80, 90, 100, 
or at least 150 consecutive nucleotides of the disclosed nucleic acid sequences. Probes and primers 
30 may be considerably longer than these examples, and it is understood that any length supported by the 
specification, including the tables, figures, and Sequence Listing, may be used. 

Methods for preparing and using probes and primers are described in the references, for 
example Sambrook, J. et a!., 1989, Millar Cloning- A 1 .ahoratory Manual, 2 nd ed., vol. 1-3, Cold 
Spring Harbor Press, Plainview NY; Ausubel, F.M. et al.,1987, Oirn-nt Protocols in Molecular 
35 Biology. Greene Publ. Assoc. & Wiley-Intersciences, New York NY; Innis, M. et al., 1990, PCR 

Protocols. A GuiHr to Methods and Applications . Academic Press, San Diego CA. PCR primer pairs 
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can be derived from a known sequence, for example, by using computer programs intended for that 
purpose such as Primer (Version 0.5, 1991, Whitehead Institute for Biomedical Research, Cambridge 



5 purpose. For example, OLIGO 4.06 software is useful for the selection of PCR primer pairs of up to 
100 nucleotides each, and for the analysis of oligonucleotides and larger polynucleotides of up to 
5,000 nucleotides from an input polynucleotide sequence of up to 32 kilobases. Similar primer 
selection programs have incorporated additional features for expanded capabilities. For example, the 
PrimOU primer selection program (available to the public from the Genome Center at University of 

10 Texas South West Medical Center, Dallas TX) is capable of choosing specific primers from megabase 
sequences and is thus useful for designing primers on a genome-wide scope. The Primer3 primer 
selection program (available to the public from the Whitehead Institute/MIT Center for Genome 
Research, Cambridge MA) allows the user to input a "misprinting library," in which sequences to 
avoid as primer binding sites are user-specified. Primer3 is useful, in particular, for the selection of 

15 oligonucleotides for microarrays. (The source code for the latter two primer selection programs may 
also be obtained from their respective sources and modified to meet the user's specific needs.) The 
PrimeGen program (available to the public from the UK Human Genome Mapping Project Resource 
Centre, Cambridge UK) designs primers based on multiple sequence alignments, thereby allowing 
selection of primers that hybridize to either the most conserved or least conserved regions of aligned 

20 nucleic acid sequences. Hence, this program is useful for identification of both unique and conserved 
oligonucleotides and polynucleotide fragments. The oligonucleotides and polynucleotide fragments 
identified by any of the above selection methods are useful in hybridization technologies, for example, 
as PCR or sequencing primers, microarray elements, or specific probes to identify fully or partially 
complementary polynucleotides in a sample of nucleic acids. Methods of oligonucleotide selection 

25 are not limited to those described above. 

A "recombinant nucleic acid" is a sequence that is not naturally occurring or has a sequence 
that is made by an artificial combination of two or more otherwise separated segments of sequence. 
This artificial combination is often accomplished by chemical synthesis or, more commonly, by the 
artificial manipulation of isolated segments of nucleic acids, e.g., by genetic engineering techniques 

30 such as those described in Sambrook, supra . The term recombinant includes nucleic acids that have 
been altered solely by addition, substitution, or deletion of a portion of the nucleic acid. Frequently, a 
recombinant nucleic acid may include a nucleic acid sequence operably linked to a promoter 
sequence. Such a recombinant nucleic acid may be part of a vector that is used, for example, to 
transform a cell. 

35 Alternatively, such recombinant nucleic acids may be part of a viral vector, e.g., based on a 

vaccinia virus, that could be use to vaccinate a mammal wherein the recombinant nucleic acid is 



MA). 



Oligonucleotides for use as primers are selected using software known in the art for such 
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expressed, inducing a protective immunological response in the mammal. 

A "regulatory element" refers to a nucleic acid sequence usually derived from untranslated 
regions of a gene and includes enhancers, promoters, introns, and 5' and 3' untranslated regions 
(UTRs). Regulatory elements interact with host or viral proteins which control transcription, 
translation, or RNA stability. 

"Reporter molecules" are chemical or biochemical moieties used for labeling a nucleic acid, 
amino acid, or antibody. Reporter molecules include radionuclides; enzymes; fluorescent, 
chemiluminescent, or chromogenic agents; substrates; cofactors; inhibitors; magnetic particles; and 
other moieties known in the art. 

An "RNA equivalent," in reference to a DNA sequence, is composed of the same linear 
sequence of nucleotides as the reference DNA sequence with the exception that all occurrences of the 
nitrogenous base thymine are replaced with uracil, and the sugar backbone is composed of ribose 
instead of deoxyribose. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding TPPT, or fragments thereof, or TPPT itself, may comprise a bodily fluid; an extract 
from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic DNA, RNA, or 
cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" and "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, an antagonist, a small molecule, or any natural or 
synthetic binding composition. The interaction is dependent upon the presence of a particular 
structure of the protein, e.g., the antigenic determinant or epitope, recognized by the binding 
molecule. For example, if an antibody is specific for epitope "A," the presence of a polypeptide 
comprising the epitope A, or the presence of free unlabeled A, in a reaction containing free labeled A 
and the antibody will reduce the amount of labeled A that binds to the antibody. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least 60% free, 
preferably at least 75% free, and most preferably at least 90% free from other components with which 
they are naturally associated. 

A "substitution" refers to the replacement of one or more amino acid residues or nucleotides 
by different amino acid residues or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 
trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

A "transcript image" refers to the collective pattern of gene expression by a particular cell 
type or tissue under given conditions at a given time. 
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"Transformation" describes a process by which exogenous DNA is introduced into a recipient 
cell. Transformation may occur under natural or artificial conditions according to various methods 
well known in the art, and may rely on any known method for the insertion of foreign nucleic acid 
sequences into a prokaryotic or eukaryotic host cell. The method for transformation is selected based 

5 on the type of host cell being transformed and may include, but is not limited to, bacteriophage or 
viral infection, electroporation, heat shock, lipofection, and particle bombardment. The term 
"transformed" cells includes stably transformed cells in which the inserted DNA is capable of 
replication either as an autonomously replicating plasmid or as part of the host chromosome, as well 
as transiently transformed cells which express the inserted DNA or RNA for limited periods of time. 

10 A "transgenic organism," as used herein, is any organism, including but not limited to animals 

and plants, in which one or more of the cells of the organism contains heterologous nucleic acid 
introduced by way of human intervention, such as by transgenic techniques well known in the art. 
The nucleic acid is introduced into the cell, directly or indirectly by introduction into a precursor of 
the cell, by way of deliberate genetic manipulation, such as by microinjection or by infection with a 

15 recombinant virus. The term genetic manipulation does not include classical cross-breeding, or in 
vitro fertilization, but rather is directed to the introduction of a recombinant DNA molecule. The 
transgenic organisms contemplated in accordance with the present invention include bacteria, 
cyanobacteria, fungi, plants, and animals. The isolated DNA of the present invention can be 
introduced into the host by methods known in the art, for example infection, transfection, 

20 transformation or transconjugation. Techniques for transferring the DNA of the present invention 
into such organisms are widely known and provided in references such as Sambrook et al. (1989), 
supra . 

A "variant" of a particular nucleic acid sequence is defined as a nucleic acid sequence having 
at least 40% sequence identity to the particular nucleic acid sequence over a certain length of one of 

25 the nucleic acid sequences using blastn with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of nucleic acids may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 85%, at least 90%, at least 95% or at least 98% or 
greater sequence identity over a certain defined length. A variant may be described as, for example, 
an "allelic" (as defined above), "splice," "species " or "polymorphic" variant. A splice variant may 

30 have significant identity to a reference molecule, but will generally have a greater or lesser number of 
polynucleotides due to alternative splicing of exons during mRNA processing. The corresponding 
polypeptide may possess additional functional domains or lack domains that are present in the 
reference molecule. Species variants are polynucleotide sequences that vary from one species to 
another. The resulting polypeptides generally will have significant amino acid identity relative to 

35 each other. A polymorphic variant is a variation in the polynucleotide sequence of a particular gene 
between individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
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polymorphisms" (SNPs) in which the polynucleotide sequence varies by one nucleotide base. The 
presence of SNPs may be indicative of, for example, a certain population, a disease state, or a 
propensity for a disease state. 

A "variant" of a particular polypeptide sequence is defined as a polypeptide sequence having 
5 at least 40% sequence identity to the particular polypeptide sequence over a certain length of one of 
the polypeptide sequences using blastp with the "BLAST 2 Sequences" tool Version 2.0.9 (May-07- 
1999) set at default parameters. Such a pair of polypeptides may show, for example, at least 50%, at 
least 60%, at least 70%, at least 80%, at least 90%, at least 95%, or at least 98% or greater sequence 
identity over a certain defined length of one of the polypeptides. 

10 THE INVENTION 

The invention is based on the discovery of new human transport proteins (TPPT), the 
polynucleotides encoding TPPT, and the use of these compositions for the diagnosis, treatment, or 
prevention of transport, metabolic, neurological, cardiovascular, reproductive, and immune disorders, 
and cell proliferative disorders including cancer. 

15 Table 1 lists the Incyte clones used to assemble full length nucleotide sequences encoding 

TPPT. Columns 1 and 2 show the sequence identification numbers (SEQ ID NOs) of the polypeptide 
and nucleotide sequences, respectively. Column 3 shows the clone IDs of the Incyte clones in which 
nucleic acids encoding each TPPT were identified, and column 4 shows the cDNA libraries from 
which these clones were isolated. Column 5 shows Incyte clones and their corresponding cDNA 

20 libraries. Clones for which cDNA libraries are not indicated were derived from pooled cDNA 
libraries. In some cases, GenBank sequence identifiers are also shown in column 5. The Incyte 
clones and GenBank cDNA sequences, where indicated, in column 5 were used to assemble the 
consensus nucleotide sequence of each TPPT and are useful as fragments in hybridization 
technologies. 

25 The columns of Table 2 show various properties of each of the polypeptides of the invention: 

column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3 shows potential phosphorylation sites; column 4 shows potential glycosylation 
sites; column 5 shows the amino acid residues comprising signature sequences and motifs; column 6 
shows homologous sequences as identified by BLAST analysis; and column 7 shows analytical 

30 methods and in some cases, searchable databases to which the analytical methods were applied. The 
methods of column 7 were used to characterize each polypeptide through sequence homology and 
protein motifs. 

The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 
associated with nucleotide sequences encoding TPPT. The first column of Table 3 lists the nucleotide 
35 SEQ ID NOs. Column 2 lists fragments of the nucleotide sequences of column 1 . These fragments 
are useful, for example, in hybridization or amplification technologies to identify SEQ ID NO:44-86 
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and to distinguish between SEQ ID NO:44-86 and related polynucleotide sequences. The 
polypeptides encoded by these fragments are useful, for example, as immunogenic peptides. Column 
3 lists tissue categories which express TPPT as a fraction of total tissues expressing TPPT. Column 4 
lists diseases, disorders, or conditions associated with those tissues expressing TPPT as a fraction of 
5 total tissues expressing TPPT. Column 5 lists the vectors used to subclone each cDNA library. 

Of particular interest is the expression of SEQ ID NO:50 exclusively in cardiovascular tissue, 
the expression of SEQ ID NO:56 in nervous and gastrointestinal tissues, the expression of SEQ ID 
NO:57 in gastrointestinal tissues, and the expression of SEQ ID NO:66 in nervous system tissues. Of 
particular note is the tissue-specific expression of SEQ ID NO:75. Over 71% of the cDNA libraries 

10 expressing SEQ ID NO:75 are derived from lung tissue. 

The columns of Table 4 show descriptions of the tissues used to construct the cDNA libraries 
from which cDNA clones encoding TPPT were isolated. Column 1 references the nucleotide SEQ ID 
NOs, column 2 shows the cDNA libraries from which these clones were isolated, and column 3 shows 
the tissue origins and other descriptive information relevant to the cDNA libraries in column 2. 

15 SEQ ID NO:44 maps to chromosome 7 within the interval from 38.80 to 42.10 centiMorgans. 

SEQ ID NO:48 maps to chromosome X within the interval from 107.90 to 122.80 centiMorgans. 
SEQ ID NO:60 maps to chromosome 2 within the interval from 157.0 to 167.0 centiMorgans. SEQ 
ID NO:65 maps to chromosome 2 within the interval from 17.4 to 40.7 centiMorgans and to 
chromosome 5 within the interval from 61.1 to 69.6 centiMorgans. The interval on chromosome 5 

20 from 61.1 to 69.6 centiMorgans also contains genes associated with Cockayne syndrome. SEQ ID 
NO:69 maps to chromosome 3 within the interval from 157.40 to 162.00 centiMorgans. SEQ ID 
NO:70 maps to chromosome 3 within the interval from 176.40 to 179.80 centiMorgans. SEQ ID 
NO:71 maps to chromosome 18 within the interval from the p-terminus to 52.30 centiMorgans. SEQ 
ID NO:73 maps to chromosome 17 within the interval from 75.70 to 84.20 centiMorgans, and to 

25 chromosome 2 within the interval from 204.70 to 209.30 centiMorgans. SEQ ID NO:76 maps to 
chromosome 20 within the interval from 79.00 to 94.40 centiMorgans. SEQ ID NO:80 maps to 
chromosome 18 within the interval from 1.60 to 6.20 centiMorgans, and to chromosome 1 1 within the 
interval from 1 17.90 to 126.00 centiMorgans. SEQ ID NO:83 maps to chromosome 17 within the 
interval from 67.60 to 69.30 centiMorgans, and from 83.8 centiMorgans to the q-terminus, and to 

30 chromosome 7 within the interval from 105.20 to 1 14.50 centiMorgans. 

The invention also encompasses TPPT variants. A preferred TPPT variant is one which has 
at least about 80%, or alternatively at least about 90%, or even at least about 95% amino acid 
sequence identity to the TPPT amino acid sequence, and which contains at least one functional or 
structural characteristic of TPPT. 

35 The invention also encompasses polynucleotides which encode TPPT. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence selected 
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from the group consisting of SEQ ID NO:44-86, which encodes TPPT. The polynucleotide sequences 
of SEQ ID NO:44-86, as presented in the Sequence Listing, embrace the equivalent RNA sequences, 
wherein occurrences of the nitrogenous base thymine are replaced with uracil, and the sugar backbone 
is composed of ribose instead of deoxyribose. 

5 The invention also encompasses a variant of a polynucleotide sequence encoding TPPT. In 

particular, such a variant polynucleotide sequence will have at least about 70%, or alternatively at 
least about 85%, or even at least about 95% polynucleotide sequence identity to the polynucleotide 
sequence encoding TPPT. A particular aspect of the invention encompasses a variant of a 
polynucleotide sequence comprising a sequence selected from the group consisting of SEQ ID NO:44- 

10 86 which has at least about 70%, or alternatively at least about 85%, or even at least about 95% 
polynucleotide sequence identity to a nucleic acid sequence selected from the group consisting of 
SEQ ID NO:44-86. Any one of the polynucleotide variants described above can encode an amino acid 
sequence which contains at least one functional or structural characteristic of TPPT. 

It will be appreciated by those skilled in the art that as a result of the degeneracy of the 

15 genetic code, a multitude of polynucleotide sequences encoding TPPT, some bearing minimal 
similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 

20 polynucleotide sequence of naturally occurring TPPT, and all such variations are to be considered as 
being specifically disclosed. 

Although nucleotide sequences which encode TPPT and its variants are generally capable of 
hybridizing to the nucleotide sequence of the naturally occurring TPPT under appropriately selected 
conditions of stringency, it may be advantageous to produce nucleotide sequences encoding TPPT or 

25 its derivatives possessing a substantially different codon usage, e.g., inclusion of non-naturally 
occurring codons. Codons may be selected to increase the rate at which expression of the peptide 
occurs in a particular prokaryotic or eukaryotic host in accordance with the frequency with which 
particular codons are utilized by the host. Other reasons for substantially altering the nucleotide 
sequence encoding TPPT and its derivatives without altering the encoded amino acid sequences 

30 include the production of RNA transcripts having more desirable properties, such as a greater 
half-life, than transcripts produced from the naturally occurring sequence. 

The invention also encompasses production of DNA sequences which encode TPPT and 
TPPT derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 
synthetic sequence may be inserted into any of the many available expression vectors and cell systems 

35 using reagents well known in the art. Moreover, synthetic chemistry may be used to introduce 
mutations into a sequence encoding TPPT or any fragment thereof. 
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Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 
NO:44-86 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. and 
S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
5 152:507-5 1 1 .) Hybridization conditions, including annealing and wash conditions, are described in 
"Definitions" 

Methods for DNA sequencing are well known in the art and may be used to practice any of 
the embodiments of the invention. The methods may employ such enzymes as the Klenow fragment 
of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq polymerase (PE 

10 Biosystems, Foster City CA), thermostable T7 polymerase (Amersham Pharmacia Biotech, 

Piscataway NJ), or combinations of polymerases and proofreading exonucleases such as those found 
in the ELONGASE amplification system (Life Technologies, Gaithersburg MD). Preferably, 
sequence preparation is automated with machines such as the MICROLAB 2200 liquid transfer 
system (Hamilton, Reno NV), PTC200 thermal cycler (MJ Research, Watertown MA) and ABI 

15 CATALYST 800 thermal cycler (PE Biosystems). Sequencing is then carried out using either the 
ABI 373 or 377 DNA sequencing system (PE Biosystems), the MEGABACE 1000 DNA sequencing 
system (Molecular Dynamics, Sunnyvale CA), or other systems known in the art. The resulting 
sequences are analyzed using a variety of algorithms which are well known in the art. (See, e.g., 
Ausubel, F.M. (1997) Short Protocols in Molecular Biology . John Wiley & Sons, New York NY, unit 

20 7.7; Meyers, R.A. (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 
856-853.) 

The nucleic acid sequences encoding TPPT may be extended utilizing a partial nucleotide 
sequence and employing various PCR-based methods known in the art to detect upstream sequences, 
such as promoters and regulatory elements. For example, one method which may be employed, 

25 restriction-site PCR, uses universal and nested primers to amplify unknown sequence from genomic 
DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods Applic. 2:318-322.) 
Another method, inverse PCR, uses primers that extend in divergent directions to amplify unknown 
sequence from a circularized template. The template is derived from restriction fragments comprising 
a known genomic locus and surrounding sequences. (See, e.g., Triglia, T. et al. (1988) Nucleic Acids 

30 Res. 16:8 1 86.) A third method, capture PCR, involves PCR amplification of DNA fragments adjacent 
to known sequences in human and yeast artificial chromosome DNA. (See, e.g., Lagerstrom, M. et al. 
(1991) PCR Methods Applic. 1:111-119.) In this method, multiple restriction enzyme digestions and 
ligations may be used to insert an engineered double-stranded sequence into a region of unknown 
sequence before performing PCR. Other methods which may be used to retrieve unknown sequences 

35 are known in the art. (See, e.g., Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). 

Additionally, one may use PCR, nested primers, and PROMOTERFINDER libraries (Clontech, Palo 
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Alto CA) to walk genomic DNA. This procedure avoids the need to screen libraries and is useful in 
finding intron/exon junctions. For all PCR-based methods, primers may be designed using 
commercially available software, such as OLIGO 4.06 Primer Analysis software (National 
Biosciences, Plymouth MN) or another appropriate program, to be about 22 to 30 nucleotides in 
5 length, to have a GC content of about 50% or more, and to anneal to the template at temperatures of 
about 68°C to 72°C 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
sequences containing the 5' regions of genes, are preferable for situations in which an oligo d(T) 

10 library does not yield a full-length cDNA. Genomic libraries may be useful for extension of sequence 
into 5* non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to analyze 
the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, capillary 
sequencing may employ flowable polymers for electrophoretic separation, four different nucleotide- 

15 specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for detection of the 
emitted wavelengths. Output/light intensity may be converted to electrical signal using appropriate 
software (e.g., GENOTYPER and SEQUENCE NAVIGATOR, PE Biosystems), and the entire 
process from loading of samples to computer analysis and electronic data display may be computer 
controlled. Capillary electrophoresis is especially preferable for sequencing small DNA fragments 

20 which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof which 
encode TPPT may be cloned in recombinant DNA molecules that direct expression of TPPT, or 
fragments or functional equivalents thereof, in appropriate host cells. Due to the inherent degeneracy 
of the genetic code, other DNA sequences which encode substantially the same or a functionally 

25 equivalent amino acid sequence may be produced and used to express TPPT. 

The nucleotide sequences of the present invention can be engineered using methods generally 
known in the art in order to alter TPPT-encoding sequences for a variety of purposes including, but 
not limited to, modification of the cloning, processing, and/or expression of the gene product. DNA 
shuffling by random fragmentation and PCR reassembly of gene fragments and synthetic 

30 oligonucleotides may be used to engineer the nucleotide sequences. For example, oligonucleotide- 
mediated site-directed mutagenesis may be used to introduce mutations that create new restriction 
sites, alter glycosylation patterns, change codon preference, produce splice variants, and so forth. 

The nucleotides of the present invention may be subjected to DNA shuffling techniques such 
as MOLECULARBREEDING (Maxygen Inc., Santa Clara CA; described in U.S. Patent Number 

35 5,837,458; Chang, C.-C. et al. (1999) Nat. Biotechnol. 17:793-797; Christians, F.C. et al. (1999) Nat. 
Biotechnol. 17:259-264; and Crameri, A. et al. (1996) Nat. Biotechnol. 14:315-319) to alter or 
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improve the biological properties of TPPT, such as its biological or enzymatic activity or its ability to 
bind to other molecules or compounds. DN A shuffling is a process by which a library of gene 
variants is produced using PCR-mediated recombination of gene fragments. The library is then 
subjected to selection or screening procedures that identify those gene variants with the desired 
5 properties. These preferred variants may then be pooled and further subjected to recursive rounds of 
DNA shuffling and selection/screening. Thus, genetic diversity is created through "artificial" 
breeding and rapid molecular evolution. For example, fragments of a single gene containing random 
point mutations may be recombined, screened, and then reshuffled until the desired properties are 
optimized. Alternatively, fragments of a given gene may be recombined with fragments of 
10 homologous genes in the same gene family, either from the same or different species, thereby 

maximizing the genetic diversity of multiple naturally occurring genes in a directed and controllable 
manner. 

In another embodiment, sequences encoding TPPT may be synthesized, in whole or in part, 
using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) Nucleic Acids 

15 Symp. Ser. 7:215-223; and Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) Alternatively, 
TPPT itself or a fragment thereof may be synthesized using chemical methods. For example, peptide 
synthesis can be performed using various solution-phase or solid-phase techniques. (See, e.g., 
Creighton, T. (1984) Proteins. Str uctures and Molecular Properties . WH Freeman, New York NY, pp. 
55-60; and Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved 

20 using the ABI 431 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
TPPT, or any part thereof, may be altered during direct synthesis and/or combined with sequences 
from other proteins, or any part thereof, to produce a variant polypeptide or a polypeptide having a 
sequence of a naturally occurring polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 

25 chromatography. (See, e.g., Chiez, R.M. and F.Z. Regnier (1990) Methods Enzymol. 182:392-421.) 
The composition of the synthetic peptides may be confirmed by amino acid analysis or by sequencing. 
(See, e.g., Creighton, supra , pp. 28-53.) 

In order to express a biologically active TPPT, the nucleotide sequences encoding TPPT or 
derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector which contains 

30 the necessary elements for transcriptional and translational control of the inserted coding sequence in 
a suitable host. These elements include regulatory sequences, such as enhancers, constitutive and 
inducible promoters, and 5' and 3' untranslated regions in the vector and in polynucleotide sequences 
encoding TPPT. Such elements may vary in their strength and specificity. Specific initiation signals 
may also be used to achieve more efficient translation of sequences encoding TPPT. Such signals 

35 include the ATG initiation codon and adjacent sequences, e.g. the Kozak sequence. In cases where 
sequences encoding TPPT and its initiation codon and upstream regulatory sequences are inserted into 
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the appropriate expression vector, no additional transcriptional or transiational control signals may be 
needed. However, in cases where only coding sequence, or a fragment thereof, is inserted, exogenous 
transiational control signals including an in-frame ATG initiation codon should be provided by the 
vector. Exogenous transiational elements and initiation codons may be of various origins, both 
natural and synthetic. The efficiency of expression may be enhanced by the inclusion of enhancers 
appropriate for the particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. 
Cell Differ. 20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct expression 
vectors containing sequences encoding TPPT and appropriate transcriptional and transiational control 
elements. These methods include in vitro recombinant DNA techniques, synthetic techniques, and in 
vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) Molecular Cloning. A Laboratory 
Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 16-17; Ausubel, F.M. et al. (1995) 
Current Protocols in Molecular Biology . John Wiley & Sons, New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express sequences 
encoding TPPT. These include, but are not limited to, microorganisms such as bacteria transformed 
with recombinant bacteriophage, plasmid, or cosmid DNA expression vectors; yeast transformed with 
yeast expression vectors; insect cell systems infected with viral expression vectors (e.g., baculovirus); 
plant cell systems transformed with viral expression vectors (e.g., cauliflower mosaic virus, CaMV, or 
tobacco mosaic virus, TMV) or with bacterial expression vectors (e.g., Ti or pBR322 plasmids); or 
animal cell systems. (See, e.g., Sambrook, supra : Ausubel, supra : Van Heeke, G. and S.M. Schuster 
(1989) J. Biol. Chem. 264:5503-5509; Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; 
Scorer, C.A. et al. (1994) Bio/Technology 12:181-184; Engelhard, E.K. et al. (1994) Proc. Natl. Acad. 
Sci. USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 7:1937-1945; Takamatsu, N. 
(1987) EMBO J. 6:307-31 1; Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; Broglie, R. et al. (1984) 
Science 224:838-843; Winter, J. et al. (1991) Results Probl. Cell Differ. 17:85-105; The McGraw Hill 
Yearbook of Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196; Logan, J. 
and T. Shenk (1984) Proc. Natl. Acad. Sci. USA 81:3655-3659; and Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355.) Expression vectors derived from retroviruses, adenoviruses, or herpes or 
vaccinia viruses, or from various bacterial plasmids, may be used for delivery of nucleotide sequences 
to the targeted organ, tissue, or cell population. (See, e.g., Di Nicola, M. et al. (1998) Cancer Gen. 
Ther. 5(6):350-356; Yu, M. et al., (1993) Proc. Natl. Acad. Sci. USA 90(13):6340-6344; Buller, R.M. 
et al. (1985) Nature 317(6040):813-815; McGregor, D.P. et al. (1994) Mol. Immunol. 31 (3):2 19-226; 
and Verma, I.M. and N. Somia (1997) Nature 389:239-242.) The invention is not limited by the host 
cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected depending 
upon the use intended for polynucleotide sequences encoding TPPT. For example, routine cloning, 
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subcloning, and propagation of polynucleotide sequences encoding TPPT can be achieved using a 
multifunctional E. coli vector such as PBLUESCRIPT (Stratagene, La Jolla CA) or PSPORT1 
plasmid (Life Technologies). Ligation of sequences encoding TPPT into the vector* s multiple cloning 
site disrupts the lacZ gene, allowing a colorimetric screening procedure for identification of 

5 transformed bacteria containing recombinant molecules. In addition, these vectors may be useful for 
in vitro transcription, dideoxy sequencing, single strand rescue with helper phage, and creation of 
nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. and S.M. Schuster (1989) J. Biol. 
Chem. 264:5503-5509.) When large quantities of TPPT are needed, e.g. for the production of 
antibodies, vectors which direct high level expression of TPPT may be used. For example, vectors 

10 containing the strong, inducible T5 or T7 bacteriophage promoter may be used. 

Yeast expression systems may be used for production of TPPT. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH 
promoters, may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such 
vectors direct either the secretion or intracellular retention of expressed proteins and enable 

15 integration of foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 
1995, supra : Bitter, supra ; and Scorer, supra .) 

Plant systems may also be used for expression of TPPT. Transcription of sequences encoding 
TPPT may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used alone or in 
combination with the omega leader sequence from TMV (Takamatsu, N. (1987) EMBO J. 6:307-31 1). 

20 Alternatively, plant promoters such as the small subunit of RUBISCO or heat shock promoters may be 
used. (See, e.g., Coruzzi, supra ; Broglie, supra : and Winter, supra .) These constructs can be 
introduced into plant cells by direct DNA transformation or pathogen-mediated transfection. (See, 
e.g., The McGraw Hill Yearbook of Science and Technology (1992) McGraw Hill, New York NY, 
pp. 191-196.) 

25 In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

where an adenovirus is used as an expression vector, sequences encoding TPPT may be ligated into an 
adenovirus transcription/translation complex consisting of the late promoter and tripartite leader 
sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to obtain 
infective virus which expresses TPPT in host cells. (See, e.g., Logan, J. and T. Shenk (1984) Proc. 

30 Natl. Acad. Sci. USA 81 :3655-3659.) In addition, transcription enhancers, such as the Rous sarcoma 
virus (RSV) enhancer, may be used to increase expression in mammalian host cells. SV40 or EBV- 
based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments of 
DNA than can be contained in and expressed from a plasmid. HACs of about 6 kb to 10 Mb are 

35 constructed and delivered via conventional delivery methods (liposomes, polycationic amino 

polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. Genet. 
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15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable expression of 
TPPT in cell lines is preferred. For example, sequences encoding TPPT can be transformed into cell 
lines using expression vectors which may contain viral origins of replication and/or endogenous 
expression elements and a selectable marker gene on the same or on a separate vector. Following the 
introduction of the vector, cells may be allowed to grow for about 1 to 2 days in enriched media 
before being switched to selective media. The purpose of the selectable marker is to confer resistance 
to a selective agent, and its presence allows growth and recovery of cells which successfully express 
the introduced sequences. Resistant clones of stably transformed cells may be propagated using tissue 
culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk' and apf cells, respectively. (See, e.g., Wigler, M. et al. 
(1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, antibiotic, or 
herbicide resistance can be used as the basis for selection. For example, dhfr confers resistance to 
methotrexate; neo confers resistance to the aminoglycosides neomycin and G-418; and als and pat 
confer resistance to chlorsulfuron and phosphinotricin acetyl transferase, respectively. (See, e.g., 
Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; Colbere-Garapin, F. et al. (1981) 
J. Mol. Biol. 150:1-14.) Additional selectable genes have been described, e.g., trpB and hisD, which 
alter cellular requirements for metabolites. (See, e.g., Hartman, S.C. and R.C. Mulligan (1988) Proc. 
Natl. Acad. Sci. USA 85:8047-8051.) Visible markers, e.g., anthocyanins, green fluorescent proteins 
(GFP; Clontech), 6 glucuronidase and its substrate B-glucuronide, or luciferase and its substrate 
luciferin may be used. These markers can be used not only to identify transformants, but also to 
quantify the amount of transient or stable protein expression attributable to a specific vector system. 
(See, e.g., Rhodes, C.A. (1995) Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of interest is 
also present, the presence and expression of the gene may need to be confirmed. For example, if the 
sequence encoding TPPT is inserted within a marker gene sequence, transformed cells containing 
sequences encoding TPPT can be identified by the absence of marker gene function. Alternatively, a 
marker gene can be placed in tandem with a sequence encoding TPPT under the control of a single 
promoter. Expression of the marker gene in response to induction or selection usually indicates 
expression of the tandem gene as well. 

In general, host cells that contain the nucleic acid sequence encoding TPPT and that express 
TPPT may be identified by a variety of procedures known to those of skill in the art. These 
procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PGR 
amplification, and protein bioassay or immunoassay techniques which include membrane, solution, or 



36 



WO 00/78953 



PCT/US00/16668 



chip based technologies for the detection and/or quantification of nucleic acid or protein sequences. 

Immunological methods for detecting and measuring the expression of TPPT using either 
specific polyclonal or monoclonal antibodies are known in the art. Examples of such techniques 
include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), and 

5 fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering epitopes on TPPT is preferred, but a 
competitive binding assay may be employed. These and other assays are well known in the art. (See, 
e.g., Hampton, R. et al. (1990) Serological Methods, a Laboratory Manual , APS Press, St. Paul MN, 
Sect. IV; Coligan, J.E. et al. (1997) Current Protocols in Immunology , Greene Pub. Associates and 

10 Wiley-Interscience, New York NY; and Pound, J.D. (1998) Immunochemical Protocols, Humana 
Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art and 
may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding TPPT 
15 include oiigolabeling, nick translation, end-labeling, or PCR amplification using a labeled nucleotide. 
Alternatively, the sequences encoding TPPT, or any fragments thereof, may be cloned into a vector 
for the production of an mRNA probe. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by addition of an appropriate RNA polymerase 
such as T7, T3, or SP6 and labeled nucleotides. These procedures may be conducted using a variety 
20 of commercially available kits, such as those provided by Amersham Pharmacia Biotech, Prornega 
(Madison WI), and US Biochemical. Suitable reporter molecules or labels which may be used for 
ease of detection include radionuclides, enzymes, fluorescent, chemi luminescent, or chromogenic 
agents, as well as substrates, cofactors, inhibitors, magnetic particles, and the like. 

Host cells transformed with nucleotide sequences encoding TPPT may be cultured under 
25 conditions suitable for the expression and recovery of the protein from cell culture. The protein 

produced by a transformed cell may be secreted or retained intracellularly depending on the sequence 
and/or the vector used. As will be understood by those of skill in the art, expression vectors 
containing polynucleotides which encode TPPT may be designed to contain signal sequences which 
direct secretion of TPPT through a prokaryotic or eukaryotic cell membrane. 
30 In addition, a host cell strain may be chosen for its ability to modulate expression of the 

inserted sequences or to process the expressed protein in the desired fashion. Such modifications of 
the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a "prepro" or 
"pro" form of the protein may also be used to specify protein targeting, folding, and/or activity. 
35 Different host cells which have specific cellular machinery and characteristic mechanisms for 

post-translational activities (e.g., CHO, HeLa, MDCK, HEK293, and WI38) are available from the 
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American Type Culture Collection (ATCC, Manassas V A) and may be chosen to ensure the correct 
modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding TPPT may be ligated to a heterologous sequence resulting in translation of a 
5 fusion protein in any of the aforementioned host systems. For example, a chimeric TPPT protein 
containing a heterologous moiety that can be recognized by a commercially available antibody may 
facilitate the screening of peptide libraries for inhibitors of TPPT activity. Heterologous protein and 
peptide moieties may also facilitate purification of fusion proteins using commercially available 
affinity matrices. Such moieties include, but are not limited to, glutathione S-transferase (GST), 
10 maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding peptide (CBP), 6-His, FLAG, 
c-mvc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His enable purification of their cognate 
fusion proteins on immobilized glutathione, maltose, phenylarsine oxide, calmodulin, and metal- 
chelate resins, respectively. FLAG, c-myc, and hemagglutinin (HA) enable immunoaffinity 
purification of fusion proteins using commercially available monoclonal and polyclonal antibodies 
15 that specifically recognize these epitope tags. A fusion protein may also be engineered to contain a 
proteolytic cleavage site located between the TPPT encoding sequence and the heterologous protein 
sequence, so that TPPT may be cleaved away from the heterologous moiety following purification. 
Methods for fusion protein expression and purification are discussed in Ausubel (1995, supra, ch. 10). 
A variety of commercially available kits may also be used to facilitate expression and purification of 
20 fusion proteins. 

In a further embodiment of the invention, synthesis of radiolabeled TPPT may be achieved in 
vitro using the TNT rabbit reticulocyte lysate or wheat germ extract system (Promega). These 
systems couple transcription and translation of protein-coding sequences operably associated with the 
T7, T3, or SP6 promoters. Translation takes place in the presence of a radiolabeled amino acid 
25 precursor, for example, 33 S-methionine. 

TPPT of the present invention or fragments thereof may be used to screen for compounds that 
specifically bind to TPPT. At least one and up to a plurality of test compounds may be screened for 
specific binding to TPPT. Examples of test compounds include antibodies, oligonucleotides, proteins 
(e.g., receptors), or small molecules. 
30 In one embodiment, the compound thus identified is closely related to the natural ligand of 

TPPT, e.g., a ligand or fragment thereof, a natural substrate, a structural or functional mimetic, or a 
natural binding partner. (See, Coligan, J.E. et al. (1991) Current Protocols in Immunology 1(2): 
Chapter 5.) Similarly, the compound can be closely related to the natural receptor to which TPPT 
binds, or to at least a fragment of the receptor, e.g., the ligand binding site. In either case, the 
35 compound can be rationally designed using known techniques. In one embodiment, screening for 
these compounds involves producing appropriate cells which express TPPT, either as a secreted 
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protein or on the cell membrane. Preferred cells include cells from mammals, yeast, Drosophila . or E. 
coli . Cells expressing TPPT or cell membrane fractions which contain TPPT are then contacted with 
a test compound and binding, stimulation, or inhibition of activity of either TPPT or the compound is 
analyzed. 

5 An assay may simply test binding of a test compound to the polypeptide, wherein binding is 

detected by a fluorophore, radioisotope, enzyme conjugate, or other detectable label. For example, 
the assay may comprise the steps of combining at least one test compound with TPPT, either in 
solution or affixed to a solid support, and detecting the binding of TPPT to the compound. 
Alternatively, the assay may detect or measure binding of a test compound in the presence of a 
10 labeled competitor. Additionally, the assay may be carried out using cell-free preparations, chemical 
libraries, or natural product mixtures, and the test compound(s) may be free in solution or affixed to a 
solid support. 

TPPT of the present invention or fragments thereof may be used to screen for compounds that 
modulate the activity of TPPT. Such compounds may include agonists, antagonists, or partial or 

15 inverse agonists. In one embodiment, an assay is performed under conditions permissive for TPPT 
activity, wherein TPPT is combined with at least one test compound, and the activity of TPPT in the 
presence of a test compound is compared with the activity of TPPT in the absence of the test 
compound. A change in the activity of TPPT in the presence of the test compound is indicative of a 
compound that modulates the activity of TPPT. Alternatively, a test compound is combined with an 

20 in vitro or cell-free system comprising TPPT under conditions suitable for TPPT activity, and the 
assay is performed. In either of these assays, a test compound which modulates the activity of TPPT 
may do so indirectly and need not come in direct contact with the test compound. At least one and up 
to a plurality of test compounds may be screened. 

In another embodiment, polynucleotides encoding TPPT or their mammalian homologs may 

25 be "knocked out" in an animal model system using homologous recombination in embryonic stem 
(ES) cells. Such techniques are well known in the art and are useful for the generation of animal 
models of human disease. (See, e.g., U.S. Patent No. 5,175,383 and U.S. Patent No. 5,767,337.) For 
example, mouse ES cells, such as the mouse 129/SvJ cell line, are derived from the early mouse 
embryo and grown in culture. The ES cells are transformed with a vector containing the gene of 

30 interest disrupted by a marker gene, e.g., the neomycin phosphotransferase gene (neo; Capecchi, M.R. 
(1989) Science 244:1288-1292). The vector integrates into the corresponding region of the host 
genome by homologous recombination. Alternatively, homologous recombination takes place using 
the Cre-loxP system to knockout a gene of interest in a tissue- or developmental stage-specific 
manner (Marth, J.D. (1996) Clin. Invest. 97:1999-2002; Wagner, K.U. et al. (1997) Nucleic Acids 

35 Res. 25:4323-4330). Transformed ES cells are identified and microinjected into mouse cell 

blastocysts such as those from the C57BL/6 mouse strain. The blastocysts are surgically transferred 
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to pseudopregnant dams, and the resulting chimeric progeny are genotyped and bred to produce 
heterozygous or homozygous strains. Transgenic animals thus generated may be tested with potential 
therapeutic or toxic agents. 

Polynucleotides encoding TPPT may also be manipulated in vitro in ES cells derived from 

5 human blastocysts. Human ES cells have the potential to differentiate into at least eight separate cell 
lineages including endoderm, mesoderm, and ectodermal cell types. These cell lineages differentiate 
into, for example, neural cells, hematopoietic lineages, and cardiomyocytes (Thomson, J.A. et al. 
(1998) Science 282:1 145-1147). 

Polynucleotides encoding TPPT can also be used to create "knockin" humanized animals 

10 (pigs) or transgenic animals (mice or rats) to model human disease. With knockin technology, a 

region of a polynucleotide encoding TPPT is injected into animal ES cells, and the injected sequence 
integrates into the animal cell genome. Transformed cells are injected into blastulae, and the 
blastulae are implanted as described above. Transgenic progeny or inbred lines are studied and 
treated with potential pharmaceutical agents to obtain information on treatment of a human disease. 

15 Alternatively, a mammal inbred to overexpress TPPT, e.g., by secreting TPPT in its milk, may also 
serve as a convenient source of that protein (Janne, J. et al. (1998) BiotechnoL Annu. Rev. 4:55-74). 
THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of TPPT and human transport proteins. In addition, the expression of TPPT is 
20 closely associated with neurological, cardiovascular, reproductive, gastrointestinal, and 

hematopoietic/immune tissues, and inflammation, cell proliferation, and cancer. Therefore, TPPT 
appears to play a role in transport, metabolic, neurological, cardiovascular, reproductive, and immune 
disorders, and cell proliferative disorders including cancer. In the treatment of disorders associated 
with increased TPPT expression or activity, it is desirable to decrease the expression or activity of 
25 TPPT. In the treatment of disorders associated with decreased TPPT expression or activity, it is 
desirable to increase the expression or activity of TPPT. 

Therefore, in one embodiment, TPPT or a fragment or derivative thereof may be administered 
to a subject to treat or prevent a disorder associated with decreased expression or activity of TPPT. 
Examples of such disorders include, but are not limited to, a transport disorder such as akinesia, 
30 amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, Becker's muscular dystrophy, 

Beirs palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes insipidus, diabetic neuropathy, 
Duchenne muscular dystrophy, hyperkalemic periodic paralysis, normokalemic periodic paralysis, 
Parkinson's disease, malignant hyperthermia, multidrug resistance, myasthenia gravis, myotome 
dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral neuropathy, cerebral neoplasms, 
35 prostate cancer; cardiac disorders associated with transport, e.g., angina, bradyarrythmia, 
tachyarrythmia, hypertension, Long QT syndrome, myocarditis, cardiomyopathy, nemaline 
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myopathy, centronuclear myopathy, lipid myopathy, mitochondrial myopathy, thyrotoxic myopathy, 
ethanol myopathy, dermatomyositis, inclusion body myosins, infectious myositis, polymyositis; 
neurological disorders associated with transport, e.g., Alzheimer's disease, amnesia, bipolar disorder, 
dementia, depression, epilepsy, Tourette's disorder, paranoid psychoses, and schizophrenia; and other 
5 disorders associated with transport, e.g., neurofibromatosis, postherpetic neuralgia, trigeminal 

neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, cataracts, infertility, pulmonary artery 
stenosis, sensorineural autosomal deafness, hyperglycemia, hypoglycemia, Grave's disease, goiter, 
Cushing's disease, Addison's disease, glucose- galactose malabsorption syndrome, 
hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes disease, occipital horn 
10 syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and Fanconi disease; a 
metabolic disorder such as Addison's disease, cerebrotendinous xanthomatosis, congenital adrenal 
hyperplasia, coumarin resistance, cystic fibrosis, diabetes, fatty hepatocirrhosis, 
fructose- 1 ,6-diphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism, 
15 hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases, 
mannosidosis, neuraminidase deficiency, obesity, pentosuria phenylketonuria, and pseudovitamin D- 
deficiency rickets; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 
20 disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt-Jakob disease, and 
25 Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
30 peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 

endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
35 frontotemporal dementia; a cardiovascular disorder such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, 



41 



WO 00/78953 



PCT/US00/16668 



thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
angioplasty, vascular replacement, and coronary artery bypass graft surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 
degenerative valvular heart disease, calcific aortic valve stenosis, congenially bicuspid aortic valve, 
5 mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, congenital lung 
anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary 
10 hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive pulmonary 
disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, emphysema, chronic 
bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and mycoplasmal pneumonia, 
lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, pneumoconioses, sarcoidosis, 
idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, hypersensitivity pneumonitis, 
15 pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, diffuse pulmonary 

hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary hemosiderosis, pulmonary 
involvement in collagen-vascular disorders, pulmonary alveolar proteinosis, lung tumors, 
inflammatory and noninflammatory pleural effusions, pneumothorax, pleural tumors, drug-induced 
lung disease, radiation-induced lung disease, and complications of lung transplantation; a 
20 reproductive disorder such as a disorder of prolactin production, infertility, including tubal disease, 
ovulatory defects, and endometriosis, a disruption of the estrous cycle, a disruption of the menstrual 
cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an endometrial or ovarian 
tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and teratogenesis; cancer of the 
breast, fibrocystic breast disease, and galactorrhea; a disruption of spermatogenesis, abnormal sperm 
25 physiology, cancer of the testis, cancer of the prostate, benign prostatic hyperplasia, prostatitis, 

Peyronie's disease, impotence, carcinoma of the male breast, and gynecomastia; an immune disorder 
such as inflammation, actinic keratosis, acquired immunodeficiency syndrome (AIDS), Addison's 
disease, adult respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, 
arteriosclerosis, asthma, atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, 
30 autoimmune polyendocrinopathy-candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, 
cholecystitis, cirrhosis, contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, 
diabetes mellitus, emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, 
erythema nodosum, atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout. Graves' 
disease, Hashimoto's thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, 
35 irritable bowel syndrome, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia 
gravis, myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, 
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pancreatitis, polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, 
scleroderma, Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic 
sclerosis, primary thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner 
syndrome, complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, 
5 fungal, parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 
lymphoma, leukemia, and myeloma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
10 myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 
bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. 

In another embodiment, a vector capable of expressing TPPT or a fragment or derivative 
15 thereof may be administered to a subject to treat or prevent a disorder associated with decreased 
expression or activity of TPPT including, but not limited to, those described above. 

In a further embodiment, a pharmaceutical composition comprising a substantially purified 
TPPT in conjunction with a suitable pharmaceutical carrier may be administered to a subject to treat 
or prevent a disorder associated with decreased expression or activity of TPPT including, but not 
20 limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of TPPT may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of TPPT including, but not limited to, those listed above. 

In a further embodiment, an antagonist of TPPT may be administered to a subject to treat or 
25 prevent a disorder associated with increased expression or activity of TPPT. Examples of such 
disorders include, but are not limited to, those transport, metabolic, neurological, cardiovascular, 
reproductive, and immune disorders, and cell proliferative disorders including cancer described 
above. In one aspect, an antibody which specifically binds TPPT may be used directly as an 
antagonist or indirectly as a targeting or delivery mechanism for bringing a pharmaceutical agent to 
30 cells or tissues which express TPPT. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding TPPT may be administered to a subject to treat or prevent a disorder associated with 
increased expression or activity of TPPT including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, complementary 
35 sequences, or vectors of the invention may be administered in combination with other appropriate 
therapeutic agents. Selection of the appropriate agents for use in combination therapy may be made 
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by one of ordinary skill in the art, according to conventional pharmaceutical principles. The 
combination of therapeutic agents may act synergistically to effect the treatment or prevention of the 
various disorders described above. Using this approach, one may be able to achieve therapeutic 
efficacy with lower dosages of each agent, thus reducing the potential for adverse side effects. 
5 An antagonist of TPPT may be produced using methods which are generally known in the art. 

In particular, purified TPPT may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind TPPT. Antibodies to TPPT may also 
be generated using methods that are well known in the art. Such antibodies may include, but are not 
limited to, polyclonal, monoclonal, chimeric, and single chain antibodies, Fab fragments, and 
10 fragments produced by a Fab expression library. Neutralizing antibodies (i.e., those which inhibit 
dimer formation) are generally preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, humans, 
and others may be immunized by injection with TPPT or with any fragment or oligopeptide thereof 
which has immunogenic properties. Depending on the host species, various adjuvants may be used to 
15 increase immunological response. Such adjuvants include, but are not limited to, Freund's, mineral 
gels such as aluminum hydroxide, and surface active substances such as lysolecithin, pluronic polyols, 
polyanions, peptides, oil emulsions, KLH, and dinitrophenol. Among adjuvants used in humans, BCG 
(bacilli Calmette-Guerin) and Corvnebacterium parvum are especially preferable. 

It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to TPPT 
20 have an amino acid sequence consisting of at least about 5 amino acids, and generally will consist of 
at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, or fragments are 
identical to a portion of the amino acid sequence of the natural protein. Short stretches of TPPT 
amino acids may be fused with those of another protein, such as KLH, and antibodies to the chimeric 
molecule may be produced. 
25 Monoclonal antibodies to TPPT may be prepared using any technique which provides for the 

production of antibody molecules by continuous cell lines in culture. These include, but are not 
limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV-hybridoma 
technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et al. (1985) J. 
Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 80:2026-2030; and 
30 Cole, S.P. et al. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies," such as the 
splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) Proc. 
Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 312:604-608; and Takeda, 
35 S. et al. (1985) Nature 314:452-454.) Alternatively, techniques described for the production of single 
chain antibodies may be adapted, using methods known in the art, to produce TPPT-specific single 
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chain antibodies. Antibodies with related specificity, but of distinct idiotypic composition, may be 
generated by chain shuffling from random combinatorial immunoglobulin libraries. (See, e.g., 
Burton, D.R. (1991) Proc. Natl Acad. Sci. USA 88:10134-10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 
5 population or by screening immunoglobulin libraries or panels of highly specific binding reagents as 
disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for TPPT may also be generated. 
For example, such fragments include, but are not limited to, F(ab*) 2 fragments produced by pepsin 

10 digestion of the antibody molecule and Fab fragments generated by reducing the disulfide bridges of 
the F(ab*)2 fragments. Alternatively, Fab expression libraries may be constructed to allow rapid and 
easy identification of monoclonal Fab fragments with the desired specificity. (See, e.g., Huse, W.D. 
et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the desired 

15 specificity. Numerous protocols for competitive binding or immunoradiometric assays using either 
polyclonal or monoclonal antibodies with established specificities are well known in the art. Such 
immunoassays typically involve the measurement of complex formation between TPPT and its 
specific antibody. A two-site, monoclonal-based immunoassay utilizing monoclonal antibodies 
reactive to two non-interfering TPPT epitopes is generally used, but a competitive binding assay may 

20 also be employed (Pound, supra) . 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for TPPT. Affinity is expressed as an 
association constant, which is defined as the molar concentration of TPPT-antibody complex 
divided by the molar concentrations of free antigen and free antibody under equilibrium conditions. 

25 The Ka determined for a preparation of polyclonal antibodies, which are heterogeneous in their 

affinities for multiple TPPT epitopes, represents the average affinity, or avidity, of the antibodies for 
TPPT. The determined for a preparation of monoclonal antibodies, which are monospecific for a 
particular TPPT epitope, represents a true measure of affinity. High-affinity antibody preparations 
with K, ranging from about 10 9 to 10 )2 L/mole are preferred for use in immunoassays in which the 

30 TPPT-antibody complex must withstand rigorous manipulations. Low-affinity antibody preparations 
with Kj ranging from about 10 6 to 10 7 L/mole are preferred for use in immunopuriflcation and similar 
procedures which ultimately require dissociation of TPPT, preferably in active form, from the 
antibody (Catty, D. (1988) Antibodies. Volume I: A Practical Approach . IRL Press, Washington DC; 
Liddell, J.E. and A. Cryer (1991 ) A Practical Guide to Monoclonal Antibodies . John Wiley & Sons, 

35 New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
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determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, preferably 
5-10 mg specific antibody/ml, is generally employed in procedures requiring precipitation of TPPT- 
antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, and guidelines 
5 for antibody quality and usage in various applications, are generally available. (See, e.g., Catty, 
supra , and Coligan et ah, supra .) 

In another embodiment of the invention, the polynucleotides encoding TPPT, or any fragment 
or complement thereof, may be used for therapeutic purposes. In one aspect, modifications of gene 
expression can be achieved by designing complementary sequences or antisense molecules (DNA, 

10 RNA, PNA, or modified oligonucleotides) to the coding or regulatory regions of the gene encoding 
TPPT. Such technology is well known in the art, and antisense oligonucleotides or larger fragments 
can be designed from various locations along the coding or control regions of sequences encoding 
TPPT. (See, e.g., Agrawal, S., ed. ( 1996) Antisense Therapeutics , Humana Press Inc., Totawa NJ.) 
In therapeutic use, any gene delivery system suitable for introduction of the antisense 

15 sequences into appropriate target cells can be used. Antisense sequences can be delivered 

intracellulariy in the form of an expression plasmid which, upon transcription, produces a sequence 
complementary to at least a portion of the cellular sequence encoding the target protein. (See, e.g., 
Slater, J.E. et al. (1998) J. Allergy Clin. Immunol. 102(3):469-475; and Scanlon, K.J. et al. (1995) 
9( 1 3): 1 288- 1 296.) Antisense sequences can also be introduced intracellulariy through the use of viral 

20 vectors, such as retrovirus and adeno-associated virus vectors. (See, e.g., Miller, A.D. (1990) Blood 
76:271; Ausubel, supra : Uckert, W. and W. Walther (1994) Pharmacol. Ther. 63(3):323-347.) Other 
gene delivery mechanisms include liposome-derived systems, artificial viral envelopes, and other 
systems known in the art. (See, e.g., Rossi, JJ. (1995) Br. Med. Bull. 51(l):217-225; Boado, R.J. et 
al. (1998) J. Pharm. Sci. 87(1 1 ): 1308-13 15; and Morris, M.C. et al. (1997) Nucleic Acids Res. 

25 25(14):2730-2736.) 

In another embodiment of the invention, polynucleotides encoding TPPT may be used for 
somatic or germline gene therapy. Gene therapy may be performed to (i) correct a genetic deficiency 
(e.g., in the cases of severe combined immunodeficiency (SCID)-Xl disease characterized by X- 
Iinked inheritance (Cavazzana-Calvo, M. et al. (2000) Science 288:669-672), severe combined 

30 immunodeficiency syndrome associated with an inherited adenosine deaminase (ADA) deficiency 
(Blaese, R.M. et al. (1995) Science 270:475-480; Bordignon, C. et al. (1995) Science 270:470-475), 
cystic fibrosis (Zabner, J. et al. (1993) Cell 75:207-216; Crystal, R.G. et al. (1995) Hum. Gene 
Therapy 6:643-666; Crystal, R.G. et al. (1995) Hum. Gene Therapy 6:667-703), thalassamias, familial 
hypercholesterolemia, and hemophilia resulting from Factor VIII or Factor IX deficiencies (Crystal, 

35 R.G. (1995) Science 270:404-410; Verma, I.M. and Somia, N. (1997) Nature 389:239-242)), (ii) 

express a conditionally lethal gene product (e.g., in the case of cancers which result from unregulated 
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cell proliferation), or (iii) express a protein which affords protection against intracellular parasites 
(e.g., against human retroviruses, such as human immunodeficiency virus (HIV) (Baltimore, D. (1988) 
Nature 335:395-396; Poeschla, E. et al. (1996) Proc. Natl. Acad. Sci. USA. 93:1 1395-1 1399), 
hepatitis B or C virus (HBV, HCV); fungal parasites, such as Candida albicans and Paracoccidioides 
brasiliensis ; and protozoan parasites such as Plasmodium falciparum and Trypanosoma cruzn . In the 
case where a genetic deficiency in TPPT expression or regulation causes disease, the expression of 
TPPT from an appropriate population of transduced cells may alleviate the clinical manifestations 
caused by the genetic deficiency. 

In a further embodiment of the invention, diseases or disorders caused by deficiencies in 
TPPT are treated by constructing mammalian expression vectors encoding TPPT and introducing 
these vectors by mechanical means into TPPT-deficient cells. Mechanical transfer technologies for 
use with cells in vivo or ex vitro include (i) direct DNA microinjection into individual cells, (ii) 
ballistic gold particle delivery, (iii) liposome-mediated transfection, (iv) receptor-mediated gene 
transfer, and (v) the use of DNA transposons (Morgan, R.A. and W.F. Anderson (1993) Annu. Rev. 
Biochem. 62:191-217; Ivies, Z. (1997) Cell 91:501-510; Boulay, J-L. and H. Recipon (1998) Curr. 
Opin. Biotechnol. 9:445-450). 

Expression vectors that may be effective for the expression of TPPT include, but are not 
limited to, the PCDNA 3.1, EPITAG, PRCCMV2, PREP, PVAX vectors (Invitrogen, Carlsbad CA), 
PCMV-SCRTPT, PCMV-TAG, PEGSH/PERV (Stratagene, La Jolla CA), and PTET-OFF, PTET-ON, 
PTRE2, PTRE2-LUC, PTK-HYG (Clontech, Palo Alto CA). TPPT may be expressed using (i) a 
constitutively active promoter, (e.g., from cytomegalovirus (CMV), Rous sarcoma virus (RSV), SV40 
virus, thymidine kinase (TK), or p-actin genes), (ii) an inducible promoter (e.g., the 
tetracycline-regulated promoter (Gossen, M. and H. Bujard (1992) Proc. Natl. Acad. Sci. USA 
89:5547-5551; Gossen, M. et al. (1995) Science 268:1766-1769; Rossi, F.M.V. and H.M. Blau (1998) 
Curr. Opin. Biotechnol. 9:45 1-456), commercially available in the T-REX plasmid (Invitrogen)); the 
ecdysone-inducible promoter (available in the plasmids PVGRXR and PLND; Invitrogen); the 
FK506/rapamycin inducible promoter; or the RU486/mifepristone inducible promoter (Rossi, F.M.V. 
and H.M. Blau, supra)), or (iii) a tissue-specific promoter or the native promoter of the endogenous 
gene encoding TPPT from a normal individual. 

Commercially available liposome transformation kits (e.g., the PERFECT LIPID 
TRANSFECTION KIT, available from Invitrogen) allow one with ordinary skill in the art to deliver 
polynucleotides to target cells in culture and require minimal effort to optimize experimental 
parameters. In the alternative, transformation is performed using the calcium phosphate method 
(Graham, F.L. and A.J. Eb (1973) Virology 52:456-467), or by electroporation (Neumann, E. et al. 
(1982) EMBO J. 1:841-845). The introduction of DNA to primary cells requires modification of 
these standardized mammalian transfection protocols. 
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In another embodiment of the invention, diseases or disorders caused by genetic defects with 
respect to TPPT expression are treated by constructing a retrovirus vector consisting of (i) the 
polynucleotide encoding TPPT under the control of an independent promoter or the retrovirus long 
terminal repeat (LTR) promoter, (ii) appropriate RNA packaging signals, and (iii) a Rev-responsive 

5 element (RRE) along with additional retrovirus c/s-acting RNA sequences and coding sequences 
required for efficient vector propagation. Retrovirus vectors (e.g., PFB and PFBNEO) are 
commercially available (Stratagene) and are based on published data (Riviere, I. et al. (1995) Proc. 
Natl. Acad. Sci. USA 92:6733-6737), incorporated by reference herein. The vector is propagated in 
an appropriate vector producing cell line (VPCL) that expresses an envelope gene with a tropism for 

10 receptors on the target cells or a promiscuous envelope protein such as VSVg (Armentano, D. et al. 
(1987) J. Virol. 61:1647-1650; Bender, M.A. et al. (1987) J. Virol. 61 :1639-1646; Adam, M.A. and 
A.D. Miller (1988) J. Virol. 62:3802-3806; Dull, T. et al. (1998) J. Virol. 72:8463-8471; Zufferey, R. 
et al. (1998) J. Virol. 72:9873-9880). U.S. Patent Number 5,910,434 to Rigg ("Method for obtaining 
retrovirus packaging cell lines producing high transducing efficiency retroviral supernatant") 

15 discloses a method for obtaining retrovirus packaging cell lines and is hereby incorporated by 

reference. Propagation of retrovirus vectors, transduction of a population of cells (e.g., CD4* T-cells), 
and the return of transduced cells to a patient are procedures well known to persons skilled in the art 
of gene therapy and have been well documented (Ranga, U. et al. (1997) J. Virol. 71:7020-7029; 
Bauer, G. et al. (1997) Blood 89:2259-2267; Bonyhadi, M.L. (1997) J. Virol. 71:4707-4716; Ranga, 

20 U. et al. (1998) Proc. Natl. Acad. Sci. USA 95: 1201-1206; Su, L. (1997) Blood 89:2283-2290). 

In the alternative, an adenovirus-based gene therapy delivery system is used to deliver 
polynucleotides encoding TPPT to cells which have one or more genetic abnormalities with respect to 
the expression of TPPT. The construction and packaging of adenovirus-based vectors are well known 
to those with ordinary skill in the art. Replication defective adenovirus vectors have proven to be 

25 versatile for importing genes encoding immunoregulatory proteins into intact islets in the pancreas 
(Csete, M.E. et al. (1995) Transplantation 27:263-268). Potentially useful adenoviral vectors are 
described in U.S. Patent Number 5,707,618 to Armentano ("Adenovirus vectors for gene therapy"), 
hereby incorporated by reference. For adenoviral vectors, see also Antinozzi, P. A. et al. (1999) Annu. 
Rev. Nutr. 19:51 1-544; and Verma, I.M. and N. Somia (1997) Nature 18:389:239-242, both 

30 incorporated by reference herein. 

In another alternative, a herpes-based, gene therapy delivery system is used to deliver 
polynucleotides encoding TPPT to target cells which have one or more genetic abnormalities with 
respect to the expression of TPPT. The use of herpes simplex virus (HSV)-based vectors may be 
especially valuable for introducing TPPT to cells of the central nervous system, for which HSV has a 

35 tropism. The construction and packaging of herpes-based vectors are well known to those with 

ordinary skill in the art. A replication-competent herpes simplex virus (HSV) type 1 -based vector has 
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been used to deliver a reporter gene to the eyes of primates (Liu, X. et al. (1999) Exp. Eye 
Res. 169:385-395). The construction of a HSV-1 virus vector has also been disclosed in detail in U.S. 
Patent Number 5,804,413 to DeLuca ("Herpes simplex virus strains for gene transfer"), which is 
hereby incorporated by reference. U.S. Patent Number 5,804,413 teaches the use of recombinant 
5 HSV d92 which consists of a genome containing at least one exogenous gene to be transferred to a 
cell under the control of the appropriate promoter for purposes including human gene therapy. Also 
taught by this patent are the construction and use of recombinant HSV strains deleted for ICP4, ICP27 
and ICP22. For HSV vectors, see also Goins, W.F. et al. (1999) J. Virol. 73:519-532 and Xu, H. et al. 
(1994) Dev. Biol. 163:152-161, hereby incorporated by reference. The manipulation of cloned 
10 herpesvirus sequences, the generation of recombinant virus following the transfection of multiple 
plasmids containing different segments of the large herpesvirus genomes, the growth and propagation 
of herpesvirus, and the infection of cells with herpesvirus are techniques well known to those of 
ordinary skill in the art. 

In another alternative, an alphavirus (positive, single-stranded RNA virus) vector is used to 
15 deliver polynucleotides encoding TPPT to target cells. The biology of the prototypic alphavirus, 
Semliki Forest Virus (SFV), has been studied extensively and gene transfer vectors have been based 
on the SFV genome (Garoff, H. and K.-J. Li (1998) Curr. Opin. Biotech. 9:464-469). During 
alphavirus RNA replication, a subgenomic RNA is generated that normally encodes the viral capsid 
proteins. This subgenomic RNA replicates to higher levels than the full-length genomic RNA, 
20 resulting in the overproduction of capsid proteins relative to the viral proteins with enzymatic activity 
(e.g., protease and polymerase). Similarly, inserting the coding sequence for TPPT into the alphavirus 
genome in place of the capsid-coding region results in the production of a large number of TPPT- 
coding RNAs and the synthesis of high levels of TPPT in vector transduced cells. While alphavirus 
infection is typically associated with cell lysis within a few days, the ability to establish a persistent 
25 infection in hamster normal kidney cells (BHK-21) with a variant of Sindbis virus (SIN) indicates that 
the lytic replication of alphaviruses can be altered to suit the needs of the gene therapy application 
(Dryga, S.A. et al. (1997) Virology 228:74-83). The wide host range of alphaviruses will allow the 
introduction of TPPT into a variety of cell types. The specific transduction of a subset of cells in a 
population may require the sorting of cells prior to transduction. The methods of manipulating 
30 infectious cDNA clones of alphaviruses, performing alphavirus cDNA and RNA transfections, and 
performing alphavirus infections, are well known to those with ordinary skill in the art. 

Oligonucleotides derived from the transcription initiation site, e.g., between about positions 
-10 and +10 from the start site, may also be employed to inhibit gene expression. Similarly, inhibition 
can be achieved using triple helix base-pairing methodology. Triple helix pairing is useful because it 
35 causes inhibition of the ability of the double helix to open sufficiently for the binding of polymerases, 
transcription factors, or regulatory molecules. Recent therapeutic advances using triplex DNA have 
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been described in the literature. (See, e.g., Gee, J.E. et al. (1994) in Huber, B.E. and B.I. Carr, 
Molecular and Immunologic Approaches . Futura Publishing, Mt. Kisco NY, pp. 163-177.) A 
complementary sequence or antisense molecule may also be designed to block translation of mRNA 
by preventing the transcript from binding to ribosomes. 

Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage of 
RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the ribozyme 
molecule to complementary target RNA, followed by endonucleolytic cleavage. For example, 
engineered hammerhead motif ribozyme molecules may specifically and efficiently catalyze 
endonucleolytic cleavage of sequences encoding TPPT. 

Specific ribozyme cleavage sites within any potential RNA target are initially identified by 
scanning the target molecule for ribozyme cleavage sites, including the following sequences: GUA, 
GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 ribonucleotides, 
corresponding to the region of the target gene containing the cleavage site, may be evaluated for 
secondary structural features which may render the oligonucleotide inoperable. The suitability of 
candidate targets may also be evaluated by testing accessibility to hybridization with complementary 
oligonucleotides using ribonuc lease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be prepared 
by any method known in the art for the synthesis of nucleic acid molecules. These include techniques 
for chemically synthesizing oligonucleotides such as solid phase phosphoramidite chemical synthesis. 
Alternatively, RNA molecules may be generated by in vitro and in vivo transcription of DNA 
sequences encoding TPPT. Such DNA sequences may be incorporated into a wide variety of vectors 
with suitable RNA polymerase promoters such as T7 or SP6. Alternatively, these cDNA constructs 
that synthesize complementary RNA, constitutively or inducibly, can be introduced into cell lines, 
cells, or tissues. 

RNA molecules may be modified to increase intracellular stability and half-life. Possible 
modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' ends 
of the molecule, or the use of phosphorothioate or T O-methyl rather than phosphodiesterase linkages 
within the backbone of the molecule. This concept is inherent in the production of PNAs and can be 
extended in all of these molecules by the inclusion of nontraditional bases such as inosine, queosine, 
and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms of adenine, cytidine, 
guanine, thymine, and uridine which are not as easily recognized by endogenous endonucleases. 

An additional embodiment of the invention encompasses a method for screening for a 
compound which is effective in altering expression of a polynucleotide encoding TPPT. Compounds 
which may be effective in altering expression of a specific polynucleotide may include, but are not 
limited to, oligonucleotides, antisense oligonucleotides, triple helix-forming oligonucleotides, 
transcription factors and other polypeptide transcriptional regulators, and non-macromolecular 
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chemical entities which are capable of interacting with specific polynucleotide sequences. Effective 
compounds may alter polynucleotide expression by acting as either inhibitors or promoters of 
polynucleotide expression. Thus, in the treatment of disorders associated with increased TPPT 
expression or activity, a compound which specifically inhibits expression of the polynucleotide 

5 encoding TPPT may be therapeutically useful, and in the treament of disorders associated with 
decreased TPPT expression or activity, a compound which specifically promotes expression of the 
polynucleotide encoding TPPT may be therapeutically useful. 

At least one, and up to a plurality, of test compounds may be screened for effectiveness in 
altering expression of a specific polynucleotide. A test compound may be obtained by any method 

10 commonly known in the art, including chemical modification of a compound known to be effective in 
altering polynucleotide expression; selection from an existing, commercially-available or proprietary 
library of naturally-occurring or non-natural chemical compounds; rational design of a compound 
based on chemical and/or structural properties of the target polynucleotide; and selection from a 
library of chemical compounds created combinatorially or randomly. A sample comprising a 

15 polynucleotide encoding TPPT is exposed to at least one test compound thus obtained. The sample 
may comprise, for example, an intact or permeabilized cell, or an in vitro cell-free or reconstituted 
biochemical system. Alterations in the expression of a polynucleotide encoding TPPT are assayed by 
any method commonly known in the art. Typically, the expression of a specific nucleotide is detected 
by hybridization with a probe having a nucleotide sequence complementary to the sequence of the 

20 polynucleotide encoding TPPT. The amount of hybridization may be quantified, thus forming the 
basis for a comparison of the expression of the polynucleotide both with and without exposure to one 
or more test compounds. Detection of a change in the expression of a polynucleotide exposed to a 
test compound indicates that the test compound is effective in altering the expression of the 
polynucleotide. A screen for a compound effective in altering expression of a specific polynucleotide 

25 can be carried out, for example, using a Schizosaccharomvces pombe gene expression system 
(Atkins, D. et al. (1999) U.S. Patent No. 5.932,435; Arndt, G.M. et al. (2000) Nucleic Acids Res. 
28:E15) or a human cell line such as HeLa cell (Clarke, M.L. et al. (2000) Biochem. Biophys. Res. 
Commun. 268:8-13). A particular embodiment of the present invention involves screening a 
combinatorial library of oligonucleotides (such as deoxyribonucleotides, ribonucleotides, peptide 

30 nucleic acids, and modified oligonucleotides) for antisense activity against a specific polynucleotide 
sequence (Bruice, T.W. et al. (1997) U.S. Patent No. 5,686,242; Bruice, T.W. et al. (2000) U.S. Patent 
No. 6,022,691). 

Many methods for introducing vectors into cells or tissues are available and equally suitable 
for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into stem cells 
35 taken from the patient and clonally propagated for autologous transplant back into that same patient. 
Delivery by transfection, by liposome injections, or by polycationic amino polymers may be achieved 
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using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. (1997) Nat. 
Biotechnol. 15:462-466.) 

Any of the therapeutic methods described above may be applied to any subject in need of 
such therapy, including, for example, mammals such as humans, dogs, cats, cows, horses, rabbits, and 
5 monkeys. 

An additional embodiment of the invention relates to the administration of a pharmaceutical 
composition which generally comprises an active ingredient formulated with a pharmaceutically 
acceptable excipient. Excipients may include, for example, sugars, starches, celluloses, gums, and 
proteins. Various formulations are commonly known and are thoroughly discussed in the latest 

10 edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). Such 

pharmaceutical compositions may consist of TPPT, antibodies to TPPT, and mimetics, agonists, 
antagonists, or inhibitors of TPPT. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 

15 intramedullary, intrathecal, intraventricular, pulmonary, transdermal, subcutaneous, intraperitoneal, 
intranasal, enteral, topical, sublingual* or rectal means. 

Pharmaceutical compositions for pulmonary administration may be prepared in liquid or dry 
powder form. These compositions are generally aerosolized immediately prior to inhalation by the 
patient. In the case of small molecules (e.g. traditional low molecular weight organic drugs), aerosol 

20 delivery of fast-acting formulations is well-known in the art. In the case of macromolecules (e.g. 

larger peptides and proteins), recent developments in the field of pulmonary delivery via the alveolar 
region of the lung have enabled the practical delivery of drugs such as insulin to blood circulation 
(see, e.g., Patton, J.S. et al., U.S. Patent No. 5,997,848). Pulmonary delivery has the advantage of 
administration without needle injection, and obviates the need for potentially toxic penetration 

25 enhancers. 

Pharmaceutical compositions suitable for use in the invention include compositions wherein 
the active ingredients are contained in an effective amount to achieve the intended purpose. The 
determination of an effective dose is well within the capability of those skilled in the art. 

Specialized forms of pharmaceutical compositions may be prepared for direct intracellular 
30 delivery of macromolecules comprising TPPT or fragments thereof. For example, liposome 

preparations containing a cell-impermeable macromolecule may promote cell fusion and intracellular 
delivery of the macromolecule. Alternatively, TPPT or a fragment thereof may be joined to a short 
cationic N-terminal portion from the HIV Tat-1 protein. Fusion proteins thus generated have been 
found to transduce into the cells of all tissues, including the brain, in a mouse model system 
35 (Schwarze, S.R. et al. (1999) Science 285:1569-1572). 

For any compound, the therapeutically effective dose can be estimated initially either in cell 
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culture assays, e.g.. of neoplastic cells, or in animal models such as mice, rats, rabbits, dogs, monkeys, 
or pigs An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes for 

administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient, for example TPPT 
or fragments thereof, antibodies of TPPT, and agonists, antagonists or inhibitors of TPPT, which 
ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be determined by 
standard pharmaceutical procedures in cell cultures or with experimental animals, such as by 
calculating the ED S0 (the dose therapeutically effective in 50% of the population) or LD* (the dose 
lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic effects is the 
therapeutic index, which can be expressed as the LIVED, ratio. Pharmaceutical composite 
which exhibit large therapeutic indices are preferred. The data obtained from cell culture assays and 
animal studies are used to formulate a range of dosage for human use. The dosage contamed u> such 
compositions is preferably within a range of circulating concentrations that includes the ED 50 w.th 
,5 little or no toxicity. The dosage varies within this range depending upon the dosage form employed, 
the sensitivity of the patient, and the route of administration. 

The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of the 
active moiety or to maintain the desired effect. Factors which may be taken into account include the 
20 severity of the disease state, the general health of the subject, the age, weight, and gender of the 
subject, time and frequency of administration, drug combination(s). reaction sensitivit.es, and 
response to therapy. Long-acting pharmaceutical compositions may be administered every 3 to 4 
days, every week, or biweekly depending on the half-life and clearance rate of the particular 
formulation. 

25 Normal dosage amounts may vary from about 0.1 jig to 100,000 /.g, up to a total dose of 

about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or then- 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular cells, 

30 conditions, locations, etc. 
DIAGNOSTICS 

In another embodiment, antibodies which specifically bind TPPT may be used for the 
diagnosis of disorders characterized by expression of TPPT. or in assays to monitor patients being 
treated with TPPT or agonists, antagonists, or inhibitors of TPPT. Antibodies useful for diagnost.c 
35 purposes may be prepared in the same manner as described above for therapeutics. Diagnost.c assays 
for TPPT include methods which utilize the antibody and a label to detect TPPT in human body flu.ds 
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or in extracts of cells or tissues. The antibodies may be used with or without modification, and may 
be labeled by covalent or non-covalent attachment of a reporter molecule. A wide variety of reporter 
molecules, several of which are described above, are known in the art and may be used. 

A variety of protocols for measuring TPPT, including ELIS As, RIAs, and FACS, are known 

5 in the art and provide a basis for diagnosing altered or abnormal levels of TPPT expression. Normal 
or standard values for TPPT expression are established by combining body fluids or cell extracts 
taken from normal mammalian subjects, for example, human subjects, with antibody to TPPT under 
conditions suitable for complex formation. The amount of standard complex formation may be 
quantitated by various methods, such as photometric means. Quantities of TPPT expressed in subject, 

10 control, and disease samples from biopsied tissues are compared with the standard values. Deviation 
between standard and subject values establishes the parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding TPPT may be used for 
diagnostic purposes. The polynucleotides which may be used include oligonucleotide sequences, 
complementary RNA and DNA molecules, and PNAs. The polynucleotides may be used to detect and 

15 quantify gene expression in biopsied tissues in which expression of TPPT may be correlated with 
disease. The diagnostic assay may be used to determine absence, presence, and excess expression of 
TPPT, and to monitor regulation of TPPT levels during therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting polynucleotide 
sequences, including genomic sequences, encoding TPPT or closely related molecules may be used to 

20 identify nucleic acid sequences which encode TPPT. The specificity of the probe, whether it is made 
from a highly specific region, e.g., the 5' regulatory region, or from a less specific region, e.g., a 
conserved motif, and the stringency of the hybridization or amplification will determine whether the 
probe identifies only naturally occurring sequences encoding TPPT, allelic variants, or related 
sequences. 

25 Probes may also be used for the detection of related sequences, and may have at least 50% 

sequence identity to any of the TPPT encoding sequences. The hybridization probes of the subject 
invention may be DNA or RNA and may be derived from the sequence of SEQ ID NO:44-86 or from 
genomic sequences including promoters, enhancers, and introns of the TPPT gene. 

Means for producing specific hybridization probes for DNAs encoding TPPT include the 

30 cloning of polynucleotide sequences encoding TPPT or TPPT derivatives into vectors for the 

production of mRNA probes. Such vectors are known in the art, are commercially available, and may 
be used to synthesize RNA probes in vitro by means of the addition of the appropriate RNA 
polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled by a 
variety of reporter groups, for example, by radionuclides such as 32 P or 35 S, or by enzymatic labels, 

35 such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and the like. 
Polynucleotide sequences encoding TPPT may be used for the diagnosis of disorders 
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associated with expression of TPPT. Examples of such disorders include, but are not limited to, a 
transport disorder such as akinesia, amyotrophic lateral sclerosis, ataxia telangiectasia, cystic fibrosis, 
Becker's muscular dystrophy, Bell's palsy, Charcot-Marie Tooth disease, diabetes mellitus, diabetes 
insipidus, diabetic neuropathy, Duchenne muscular dystrophy, hyperkalemic periodic paralysis. 
5 normokalemic periodic paralysis, Parkinson's disease, malignant hyperthermia, multidrug resistance, 
myasthenia gravis, myotonic dystrophy, catatonia, tardive dyskinesia, dystonias, peripheral 
neuropathy, cerebral neoplasms, prostate cancer; cardiac disorders associated with transport, e.g., 
angina, bradyarrythmia, tachyarrythmia, hypertension, Long QT syndrome, myocarditis, 
cardiomyopathy, nemaline myopathy, centronuclear myopathy, lipid myopathy, mitochondrial 
10 myopathy, thyrotoxic myopathy, ethanol myopathy, dermatomyositis, inclusion body myositis, 

infectious myositis, polymyositis; neurological disorders associated with transport, e.g., Alzheimer's 
disease, amnesia, bipolar disorder, dementia, depression, epilepsy, Tourette's disorder, paranoid 
psychoses, and schizophrenia; and other disorders associated with transport, e.g., neurofibromatosis, 
postherpetic neuralgia, trigeminal neuropathy, sarcoidosis, sickle cell anemia, Wilson's disease, 
15 cataracts, infertility, pulmonary artery stenosis, sensorineural autosomal deafness, hyperglycemia, 
hypoglycemia, Grave's disease, goiter, Cushing's disease, Addison's disease, glucose-galactose 
malabsorption syndrome, hypercholesterolemia, adrenoleukodystrophy, Zellweger syndrome, Menkes 
disease, occipital horn syndrome, von Gierke disease, cystinuria, iminoglycinuria, Hartup disease, and 
Fanconi disease; a metabolic disorder such as Addison's disease, cerebrotendinous xanthomatosis, 
20 congenital adrenal hyperplasia, coumarin resistance, cystic fibrosis, diabetes, fatty hepatocirrhosis, 
fructose- 1,6-diphosphatase deficiency, galactosemia, goiter, glucagonoma, glycogen storage diseases, 
hereditary fructose intolerance, hyperadrenalism, hypoadrenalism, hyperparathyroidism, 
hypoparathyroidism, hypercholesterolemia, hyperthyroidism, hypoglycemia, hypothyroidism, 
hyperlipidemia, hyperlipemia, lipid myopathies, lipodystrophies, lysosomal storage diseases, 
25 mannosidosis, neuraminidase deficiency, obesity, pentosuria phenylketonuria, and pseudovitamin D- 
deficiency rickets; a neurological disorder such as epilepsy, ischemic cerebrovascular disease, stroke, 
cerebral neoplasms, Alzheimer's disease, Pick's disease, Huntington's disease, dementia, Parkinson's 
disease and other extrapyramidal disorders, amyotrophic lateral sclerosis and other motor neuron 
disorders, progressive neural muscular atrophy, retinitis pigmentosa, hereditary ataxias, multiple 
30 sclerosis and other demyelinating diseases, bacterial and viral meningitis, brain abscess, subdural 
empyema, epidural abscess, suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral 
central nervous system disease, prion diseases including kuru, Creutzfeldt- Jakob disease, and 
Gerstmann-Straussler-Scheinker syndrome, fatal familial insomnia, nutritional and metabolic diseases 
of the nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
35 encephalotrigeminal syndrome, mental retardation and other developmental disorders of the central 
nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system disorders, cranial 
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nerve disorders, spinal cord diseases, muscular dystrophy and other neuromuscular disorders, 
peripheral nervous system disorders, dermatomyositis and polymyositis, inherited, metabolic, 
endocrine, and toxic myopathies, myasthenia gravis, periodic paralysis, mental disorders including 
mood, anxiety, and schizophrenic disorders, seasonal affective disorder (SAD), akathesia, amnesia, 
5 catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 

neuralgia, Tourette's disorder, progressive supranuclear palsy, corticobasal degeneration, and familial 
frontotemporal dementia; a cardiovascular disorder such as arteriovenous fistula, atherosclerosis, 
hypertension, vasculitis, Raynaud's disease, aneurysms, arterial dissections, varicose veins, 
thrombophlebitis and phlebothrombosis, vascular tumors, and complications of thrombolysis, balloon 
10 angioplasty, vascular replacement, and coronary artery bypass graft surgery, congestive heart failure, 
ischemic heart disease, angina pectoris, myocardial infarction, hypertensive heart disease, 
degenerative valvular heart disease, calcific aortic valve stenosis, congenially bicuspid aortic valve, 
mitral annular calcification, mitral valve prolapse, rheumatic fever and rheumatic heart disease, 
infective endocarditis, nonbacterial thrombotic endocarditis, endocarditis of systemic lupus 
15 erythematosus, carcinoid heart disease, cardiomyopathy, myocarditis, pericarditis, neoplastic heart 
disease, congenital heart disease, and complications of cardiac transplantation, congenital lung 
anomalies, atelectasis, pulmonary congestion and edema, pulmonary embolism, pulmonary 
hemorrhage, pulmonary infarction, pulmonary hypertension, vascular sclerosis, obstructive 
pulmonary disease, restrictive pulmonary disease, chronic obstructive pulmonary disease, 
20 emphysema, chronic bronchitis, bronchial asthma, bronchiectasis, bacterial pneumonia, viral and 
mycoplasmal pneumonia, lung abscess, pulmonary tuberculosis, diffuse interstitial diseases, 
pneumoconioses, sarcoidosis, idiopathic pulmonary fibrosis, desquamative interstitial pneumonitis, 
hypersensitivity pneumonitis, pulmonary eosinophilia bronchiolitis obliterans-organizing pneumonia, 
diffuse pulmonary hemorrhage syndromes, Goodpasture's syndromes, idiopathic pulmonary 
25 hemosiderosis, pulmonary involvement in collagen-vascular disorders, pulmonary alveolar 
proteinosis, lung tumors, inflammatory and noninflammatory pleural effusions, pneumothorax, 
pleural tumors, drug-induced lung disease, radiation-induced lung disease, and complications of lung 
transplantation; a reproductive disorder such as a disorder of prolactin production, infertility, 
including tubal disease, ovulatory defects, and endometriosis, a disruption of the estrous cycle, a 
30 disruption of the menstrual cycle, polycystic ovary syndrome, ovarian hyperstimulation syndrome, an 
endometrial or ovarian tumor, a uterine fibroid, autoimmune disorders, an ectopic pregnancy, and 
teratogenesis; cancer of the breast, fibrocystic breast disease, and galactorrhea; a disruption of 
spermatogenesis, abnormal sperm physiology, cancer of the testis, cancer of the prostate, benign 
prostatic hyperplasia, prostatitis, Peyronie's disease, impotence, carcinoma of the male breast, and 
35 gynecomastia; an immune disorder such as inflammation, actinic keratosis, acquired 

immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress syndrome, 



56 




WO 00/78953 PCT/US00/16668 

allergies, ankylosing spondylitis, amyloidosis, anemia, arteriosclerosis, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, autoimmune polyendocrinopathy- 
candidiasis-ectodermal dystrophy (APECED), bronchitis, bursitis, cholecystitis, cirrhosis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

5 episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, atrophic 
gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, Hashimoto's 
thyroiditis, paroxysmal nocturnal hemoglobinuria, hepatitis, hypereosinophilia, irritable bowel 
syndrome, mixed connective tissue disease (MCTD), multiple sclerosis, myasthenia gravis, 
myocardial or pericardial inflammation, myelofibrosis, osteoarthritis, osteoporosis, pancreatitis, 

10 polycythemia vera, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, primary 
thrombocythemia, thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, 
complications of cancer, hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, 
parasitic, protozoal, and helminthic infections, trauma, and hematopoietic cancer including 

15 lymphoma, leukemia, and myeloma; and a cell proliferative disorder such as actinic keratosis, 
arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, mixed connective tissue disease 
(MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, polycythemia vera, psoriasis, primary 
thrombocythemia, and cancers including adenocarcinoma, leukemia, lymphoma, melanoma, 
myeloma, sarcoma, teratocarcinoma, and, in particular, cancers of the adrenal gland, bladder, bone, 

20 bone marrow, brain, breast, cervix, gall bladder, ganglia, gastrointestinal tract, heart, kidney, liver, 
lung, muscle, ovary, pancreas, parathyroid, penis, prostate, salivary glands, skin, spleen, testis, 
thymus, thyroid, and uterus. The polynucleotide sequences encoding TPPT may be used in Southern 
or northern analysis, dot blot, or other membrane-based technologies; in PCR technologies; in 
dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing fluids or tissues from 

25 patients to detect altered TPPT expression. Such qualitative or quantitative methods are well known 
in the art. 

In a particular aspect, the nucleotide sequences encoding TPPT may be useful in assays that 
detect the presence of associated disorders, particularly those mentioned above. The nucleotide 
sequences encoding TPPT may be labeled by standard methods and added to a fluid or tissue sample 

30 from a patient under conditions suitable for the formation of hybridization complexes. After a 
suitable incubation period, the sample is washed and the signal is quantified and compared with a 
standard value. If the amount of signal in the patient sample is significantly altered in comparison to a 
control sample then the presence of altered levels of nucleotide sequences encoding TPPT in the 
sample indicates the presence of the associated disorder. Such assays may also be used to evaluate 

35 the efficacy of a particular therapeutic treatment regimen in animal studies, in clinical trials, or to 
monitor the treatment of an individual patient. 
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In order to provide a basis for the diagnosis of a disorder associated with expression of TPPT, 
a normal or standard profile for expression is established. This may be accomplished by combining 
body fluids or cell extracts taken from normal subjects, either animal or human, with a sequence, or a 
fragment thereof, encoding TPPT, under conditions suitable for hybridization or amplification. 

5 Standard hybridization may be quantified by comparing the values obtained from normal subjects 
with values from an experiment in which a known amount of a substantially purified polynucleotide is 
used. Standard values obtained in this manner may be compared with values obtained from samples 
from patients who are symptomatic for a disorder. Deviation from standard values is used to establish 
the presence of a disorder. 

10 Once the presence of a disorder is established and a treatment protocol is initiated, 

hybridization assays may be repeated on a regular basis to determine if the level of expression in the 
patient begins to approximate that which is observed in the normal subject. The results obtained from 
successive assays may be used to show the efficacy of treatment over a period ranging from several 
days to months. 

15 With respect to cancer, the presence of an abnormal amount of transcript (either under- or 

overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 
development of the disease, or may provide a means for detecting the disease prior to the appearance 
of actual clinical symptoms. A more definitive diagnosis of this type may allow health professionals 
to employ preventative measures or aggressive treatment earlier thereby preventing the development 

20 or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding TPPT 
may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a polynucleotide 
encoding TPPT, or a fragment of a polynucleotide complementary to the polynucleotide encoding 

25 TPPT, and will be employed under optimized conditions for identification of a specific gene or 
condition. Oligomers may also be employed under less stringent conditions for detection or 
quantification of closely related DNA or RNA sequences. 

In a particular aspect, oligonucleotide primers derived from the polynucleotide sequences 
encoding TPPT may be used to detect single nucleotide polymorphisms (SNPs). SNPs are 

30 substitutions, insertions and deletions that are a frequent cause of inherited or acquired genetic disease 
in humans. Methods of SNP detection include, but are not limited to, single-stranded conformation 
polymorphism (SSCP) and fluorescent SSCP (fSSCP) methods. In SSCP, oligonucleotide primers 
derived from the polynucleotide sequences encoding TPPT are used to amplify DNA using the 
polymerase chain reaction (PCR). The DNA may be derived, for example, from diseased or normal 

35 tissue, biopsy samples, bodily fluids, and the like. SNPs in the DNA cause differences in the 

secondary and tertiary structures of PCR products in single-stranded form, and these differences are 
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detectable using gel electrophoresis in non-denaturing gels. In fSCCP, the oligonucleotide primers 
are fluorescently labeled, which allows detection of the amplimers in high-throughput equipment such 
as DNA sequencing machines. Additionally, sequence database analysis methods, termed in silico 
SNP (isSNP), are capable of identifying polymorphisms by comparing the sequence of individual 

5 overlapping DNA fragments which assemble into a common consensus sequence. These computer- 
based methods filter out sequence variations due to laboratory preparation of DNA and sequencing 
errors using statistical models and automated analyses of DNA sequence chromatograms. In the 
alternative, SNPs may be detected and characterized by mass spectrometry using, for example, the 
high throughput MASSARRAY system (Sequenom, Inc., San Diego CA). 

10 Methods which may also be used to quantify the expression of TPPT include radiolabeling or 

biotinylating nucleotides, coamplification of a control nucleic acid, and interpolating results from 
standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. Methods 159:235-244; Duplaa, C 
et al. (1993) Anal. Biochem. 212:229-236.) The speed of quantitation of multiple samples may be 
accelerated by running the assay in a high-throughput format where the oligomer or polynucleotide of 

15 interest is presented in various dilutions and a spectrophotometric or colorimetric response gives rapid 
quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as elements on a microarray. The microarray 
can be used in transcript imaging techniques which monitor the relative expression levels of large 

20 numbers of genes simultaneously as described in Seilhamer, J.J. et al., "Comparative Gene Transcript 
Analysis," U.S. Patent No. 5,840,484, incorporated herein by reference. The microarray may also be 
used to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, to 
monitor progression/regression of disease as a function of gene expression, and to develop and 

25 monitor the activities of therapeutic agents in the treatment of disease. In particular, this information 
may be used to develop a pharmacogenomic profile of a patient in order to select the most appropriate 
and effective treatment regimen for that patient. For example, therapeutic agents which are highly 
effective and display the fewest side effects may be selected for a patient based on his/her 
pharmacogenomic profile. 

30 In another embodiment, antibodies specific for TPPT, or TPPT or fragments thereof may be 

used as elements on a microarray. The microarray may be used to monitor or measure protein-protein 
interactions, drug-target interactions, and gene expression profiles, as described above. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, e.g., 
Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. Acad. Sci. 

35 USA 93:10614-10619; Baldeschweiler et al. (1995) PCT application W095/251 1 16; Shalon, D. et al. 
(1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. Sci. USA 94:2150- 
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2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) Various types of microarrays are well 
known and thoroughly described in DNA Microarravs: A Prac tical Approach, M. Schena, ed. (1999) 
Oxford University Press, London, hereby expressly incorporated by reference. 

In another embodiment of the invention, nucleic acid sequences encoding TPPT may be used 
5 to generate hybridization probes useful in mapping the naturally occurring genomic sequence. Either 
coding or noncoding sequences may be used, and in some instances, noncoding sequences may be 
preferable over coding sequences. For example, conservation of a coding sequence among members 
of a multi-gene family may potentially cause undesired cross hybridization during chromosomal 
mapping. The sequences may be mapped to a particular chromosome, to a specific region of a 
10 chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes (HACs), 
yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial PI 
constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends Genet. 
7:149-154.) Once mapped, the nucleic acid sequences of the invention may be used to develop 
15 genetic linkage maps, for example, which correlate the inheritance of a disease state with the 

inheritance of a particular chromosome region or restriction fragment length polymorphism (RFLP). 
(See, e.g., Lander, E.S. and D. Botstein (1986) Proc. Natl. Acad. Sci. USA 83:7353-7357.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical and genetic 
map data. (See, e.g., Heinz-Ulrich, et al. (1995) in Meyers, supra, pp. 965-968.) Examples of genetic 
20 map data can be found in various scientific journals or at the Online Mendelian Inheritance in Man 
(OMIM) World Wide Web site. Correlation between the location of the gene encoding TPPT on a 
physical map and a specific disorder, or a predisposition to a specific disorder, may help define the 
region of DNA associated with that disorder and thus may further positional cloning efforts. 

In situ hybridization of chromosomal preparations and physical mapping techniques, such as 
25 linkage analysis using established chromosomal markers, may be used for extending genetic maps. 
Often the placement of a gene on the chromosome of another mammalian species, such as mouse, may 
reveal associated markers even if the exact chromosomal locus is not known. This information is 
valuable to investigators searching for disease genes using positional cloning or other gene discovery 
techniques. Once the gene or genes responsible for a disease or syndrome have been crudely 
30 localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 1 lq22-23, 
any sequences mapping to that area may represent associated or regulatory genes for further 
investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide sequence of 
the instant invention may also be used to detect differences in the chromosomal location due to 
translocation, inversion, etc., among normal, carrier, or affected individuals. 
35 In another embodiment of the invention, TPPT, its catalytic or immunogenic fragments, or 

oligopeptides thereof can be used for screening libraries of compounds in any of a variety of drug 
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screening techniques. The fragment employed in such screening may be free in solution, affixed to a 
solid support, borne on a cell surface, or located intracellular^ . The formation of binding complexes 
between TPPT and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of compounds 

5 having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. (1984) PCT 
application WO84/03564.) In this method, large numbers of different small test compounds are 
synthesized on a solid substrate. The test compounds are reacted with TPPT, or fragments thereof, 
and washed. Bound TPPT is then detected by methods well known in the art. Purified TPPT can also 
be coated directly onto plates for use in the aforementioned drug screening techniques. Alternatively, 

10 non-neutralizing antibodies can be used to capture the peptide and immobilize it on a solid support. 

In another embodiment, one may use competitive drug screening assays in which neutralizing 
antibodies capable of binding TPPT specifically compete with a test compound for binding TPPT. In 
this manner, antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with TPPT. 

15 In additional embodiments, the nucleotide sequences which encode TPPT may be used in any 

molecular biology techniques that have yet to be developed, provided the new techniques rely on 
properties of nucleotide sequences that are currently known, including, but not limited to, such 
properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the preceding 

20 description, utilize the present invention to its fullest extent. The following preferred specific 

embodiments are, therefore, to be construed as merely illustrative, and not limitative of the remainder 
of the disclosure in any way whatsoever. 

The disclosures of all patents, applications and publications, mentioned above and below, in 
particular U.S. Ser. No. 60/139,923, U.S. Ser. No. 60/148,177, U.S. Ser. No. 60/149,357, and U.S. 

25 Ser. No. 60/162,287, are hereby expressly incorporated by reference. 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
30 tissues were homogenized and lysed in guanidinium isothiocyanate, while others were homogenized 
and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life Technologies), a 
monophasic solution of phenol and guanidine isothiocyanate. The resulting lysates were centrifuged 
over CsCl cushions or extracted with chloroform. RNA was precipitated from the lysates with either 
isopropanol or sodium acetate and ethanol, or by other routine methods. 
35 Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was isolated 
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using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles (QIAGEN. 
Chatsworth CA), or an OLIGOTEX mRNA purification kit (QIAGEN). Alternatively, RNA was 
isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE mRNA 
purification kit (Ambion, Austin TX). 

5 In some cases, Stratagene was provided with RNA and constructed the corresponding cDNA 

libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the UNIZAP 
vector system (Stratagene) or SUPERSCRIPT plasmid system (Life Technologies), using the 
recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 1997, supra, units 
5.1-6.6.) Reverse transcription was initiated using oligo d(T) or random primers. Synthetic 

10 oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA was digested with the 
appropriate restriction enzyme or enzymes. For most libraries, the cDNA was size-selected (300-1000 
bp) using SEPHACRYL S1000, SEPHAROSE CL2B, or SEPHAROSE CL4B column 
chromatography (Amersham Pharmacia Biotech) or preparative agarose gel electrophoresis. cDNAs 
were ligated into compatible restriction enzyme sites of the poly linker of a suitable plasmid, e.g., 

15 PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid (Life Technologies), pcDNA2. 1 plasmid 
(Invitrogen, Carlsbad CA), or pINCY plasmid (Incyte Genomics, Palo Alto CA). Recombinant 
plasmids were transformed into competent E. coli cells including XLl-Blue, XLl-BlueMRF, or 
SOLR from Stratagene or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 

II. Isolation of cDNA Clones 

20 Plasmids obtained as described in Example I were recovered from host cells by in vivo 

excision using the UNIZAP vector system (Stratagene) or by cell lysis. Plasmids were purified using 
at least one of the following: a Magic or WIZARD Minipreps DNA purification system (Promega); an 
AGTC Miniprep purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, 
QIAWELL 8 Plus Plasmid, QIAWELL 8 Ultra Plasmid purification systems or the R.E.A.L. PREP 96 

25 plasmid purification kit from QIAGEN. Following precipitation, plasmids were resuspended in 0. 1 
ml of distilled water and stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in a 
high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and thermal 
cycling steps were carried out in a single reaction mixture. Samples were processed and stored in 

30 384-well plates, and the concentration of amplified plasmid DNA was quantified fluorometrically 
using PICOGREEN dye (Molecular Probes, Eugene OR) and a FLUOROSKAN II fluorescence 
scanner (Labsystems Oy, Helsinki, Finland). 

III. Sequencing and Analysis 

Incyte cDNA recovered in plasmids as described in Example II were sequenced as follows. 
35 Sequencing reactions were processed using standard methods or high-throughput instrumentation 

such as the ABI CATALYST 800 (PE Biosystems) thermal cycler or the PTC-200 thermal cycler (MJ 
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Research) in conjunction with the HYDRA microdispenser (Robbins Scientific) or the MICROLAB 
2200 (Hamilton) liquid transfer system. cDNA sequencing reactions were prepared using reagents 
provided by Amersham Pharmacia Biotech or supplied in ABI sequencing kits such as the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). Electrophoretic 

5 separation of cDNA sequencing reactions and detection of labeled polynucleotides were carried out 
using the MEG AB ACE 1000 DNA sequencing system (Molecular Dynamics); the ABI PRISM 373 or 
377 sequencing system (PE Biosystems) in conjunction with standard ABI protocols and base calling 
software; or other sequence analysis systems known in the art. Reading frames within the cDNA 
sequences were identified using standard methods (reviewed in Ausubel, 1997, supra, unit 7.7). Some 

10 of the cDNA sequences were selected for extension using the techniques disclosed in Example VI. 

The polynucleotide sequences derived from cDNA sequencing were assembled and analyzed 
using a combination of software programs which utilize algorithms well known to those skilled in the 
art. Table 5 summarizes the tools, programs, and algorithms used and provides applicable 
descriptions, references, and threshold parameters. The first column of Table 5 shows the tools, 

15 programs, and algorithms used, the second column provides brief descriptions thereof, the third 
column presents appropriate references, all of which are incorporated by reference herein in their 
entirety, and the fourth column presents, where applicable, the scores, probability values, and other 
parameters used to evaluate the strength of a match between two sequences (the higher the score, the 
greater the homology between two sequences). Sequences were analyzed using MACDNASIS PRO 

20 software (Hitachi Software Engineering, South San Francisco CA) and LASERGENE software 

(DNASTAR). Polynucleotide and polypeptide sequence alignments were generated using the default 
parameters specified by the clustal algorithm as incorporated into the MEGALIGN multisequence 
alignment program (DNASTAR), which also calculates the percent identity between aligned 
sequences. 

25 The polynucleotide sequences were validated by removing vector, linker, and polyA 

sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then queried 
against a selection of public databases such as the GenBank primate, rodent, mammalian, vertebrate, 
and eukaryote databases, and BLOCKS, PRINTS, DOMO, PRODOM, and PFAM to acquire 

30 annotation using programs based on BLAST, FASTA, and BLIMPS. The sequences were assembled 
into full length polynucleotide sequences using programs based on Phred, Phrap, and Consed, and 
were screened for open reading frames using programs based on GeneMark, BLAST, and FASTA. 
The full length polynucleotide sequences were translated to derive the corresponding full length 
amino acid sequences, and these full length sequences were subsequently analyzed by querying 

35 against databases such as the GenBank databases (described above), SwissProt, BLOCKS, PRINTS, 
DOMO, PRODOM, Prosite, and Hidden Markov Model (HMM)-based protein family databases such 
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as PFAM. HMM is a probabilistic approach which analyzes consensus primary structures of gene 

families. (See, e.g., Eddy, S.R. (1996) Curr. Opin. Struct. Biol. 6:361-365.) 

The programs described above for the assembly and analysis of full length polynucleotide and 

amino acid sequences were also used to identify polynucleotide sequence fragments from SEQ ID 
5 NO:44-86. Fragments from about 20 to about 4000 nucleotides which are useful in hybridization and 

amplification technologies were described in The Invention section above. 

IV. Analysis of Polynucleotide Expression 

Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which RNAs 
10 from a particular cell type or tissue have been bound. (See, e.g., Sambrook, supra, ch. 7; Ausubel, 

1995, supra , ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or related 

molecules in cDNA databases such as GenBank or LIFESEQ (Incyte Genomics). This analysis is 

much faster than multiple membrane-based hybridizations. In addition, the sensitivity of the computer 
15 search can be modified to determine whether any particular match is categorized as exact or similar. 

The basis of the search is the product score, which is defined as: 



20 The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. The product score is a normalized value between 0 and 100, and is 
calculated as follows: the BLAST score is multiplied by the percent nucleotide identity and the 
product is divided by (5 times the length of the shorter of the two sequences). The BLAST score is 
calculated by assigning a score of +5 for every base that matches in a high-scoring segment pair 

25 (HSP), and -4 for every mismatch. Two sequences may share more than one HSP (separated by gaps). 
If there is more than one HSP, then the pair with the highest BLAST score is used to calculate the 
product score. The product score represents a balance between fractional overlap and quality in a 
BLAST alignment. For example, a product score of 100 is produced only for 100% identity over the 
entire length of the shorter of the two sequences being compared. A product score of 70 is produced 

30 either by 100% identity and 70% overlap at one end, or by 88% identity and 100% overlap at the 

other. A product score of 50 is produced either by 100% identity and 50% overlap at one end, or 79% 
identity and 100% overlap. 

The results of northern analyses are reported as a percentage distribution of libraries in which 
the transcript encoding TPPT occurred. Analysis involved the categorization of cDNA libraries by 

35 organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 



BLAST Score x Percent Identity 



5 x minimum {length(Seq. 1), length(Seq. 2)} 
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reproductive, and urologic. The disease/condition categories included cancer, inflammation, trauma, 
cell proliferation, neurological, and pooled. For each category, the number of libraries expressing the 
sequence of interest was counted and divided by the total number of libraries across all categories. 
Percentage values of tissue-specific and disease- or condition-specific expression are reported in 
5 Table 3. 

V. Chromosomal Mapping of TPPT Encoding Polynucleotides 

The cDNA sequences which were used to assemble SEQ ID NO:44-49 and SEQ ID NO:54-86 
were compared with sequences from the Incyte LIFESEQ database and public domain databases 
using BLAST and other implementations of the Smith- Waterman algorithm. Sequences from these 

10 databases that matched SEQ ID NO:44-49 and SEQ ID NO:54-86 were assembled into clusters of 
contiguous and overlapping sequences using assembly algorithms such as Phrap (Table 5). Radiation 
hybrid and genetic mapping data available from public resources such as the Stanford Human 
Genome Center (SHGC), Whitehead Institute for Genome Research (WIGR), and Genethon were 
used to determine if any of the clustered sequences had been previously mapped. Inclusion of a 

15 mapped sequence in a cluster resulted in the assignment of all sequences of that cluster, including its 
particular SEQ ID NO:, to that map location. 

The genetic map locations of SEQ ID NO:44, SEQ ID NO:48, SEQ ID NO:60, SEQ ID 
NO:65, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:71, SEQ ID NO:73, SEQ ID NO:76, SEQ ID 
NO:80,and SEQ ID NO:83 are described in The Invention as ranges, or intervals, of human 

20 chromosomes. More than one map location is reported for SEQ ID NO:65, SEQ ID NO:73, SEQ ID 
NO:80, and SEQ ID NO:83, indicating that previously mapped sequences having similarity, but not 
complete identity, to SEQ ID NO:65, SEQ ID NO:73, SEQ ID NO:80, and SEQ ID NO:83 were 
assembled into their respective clusters. The map position of an interval, in centiMorgans, is 
measured relative to the terminus of the chromosome's p-arm. (The centiMorgan (cM) is a unit of 

25 measurement based on recombination frequencies between chromosomal markers. On average, 1 cM 
is roughly equivalent to 1 megabase (Mb) of DNA in humans, although this can vary widely due to 
hot and cold spots of recombination.) The cM distances are based on genetic markers mapped by 
Genethon which provide boundaries for radiation hybrid markers whose sequences were included in 
each of the clusters. Diseases associated with the public and Incyte sequences located within the 

30 indicated intervals are also reported in the Invention section where applicable. Human genome maps 
and other resources available to the public, such as the NCBI "GeneMap*99" World Wide Web site 
(http://www.ncbi.nlm.nih.gov/genemap/), can be employed to determine if previously identified 
disease genes map within or in proximity to the intervals indicated above. 

VI. Extension of TPPT Encoding Polynucleotides 

35 The full length nucleic acid sequences of SEQ ID NO:44-86 were produced by extension of 

an appropriate fragment of the full length molecule using oligonucleotide primers designed from this 
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fragment. One primer was synthesized to initiate 5' extension of the known fragment, and the other 
primer, to initiate 3* extension of the known fragment. The initial primers were designed using 
OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 22 to 30 
nucleotides in length, to have a GC content of about 50% or more, and to anneal to the target 

5 sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which would 
result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 
extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. PCR 

10 was performed in 96-well plates using the PTC-200 thermal cycler (MJ Research, Inc.). The reaction 
mix contained DNA template, 200 nmol of each primer, reaction buffer containing Mg 2 *, (NH 4 ) 2 S0 4 , 
and P-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), ELONGASE enzyme 
(Life Technologies), and Pfu DNA polymerase (Stratagene), with the following parameters for primer 
pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 68°C, 

15 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68 °C, 5 min; Step 7: storage at 4°C. In the 
alternative, the parameters for primer pair T7 and SK+ were as follows: Step 1 : 94°C, 3 min; Step 2: 
94°C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; 
Step 6: 68°C, 5 min; Step 7: storage at 4°C 



20 quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) dissolved in IX TE 
and 0.5 ul of undiluted PCR product into each well of an opaque fluorimeter plate (Corning Costar, 
Acton MA), allowing the DNA to bind to the reagent. The plate was scanned in a Fluoroskan II 
(Labsystems Oy, Helsinki, Finland) to measure the fluorescence of the sample and to quantify the 
concentration of DNA. A 5 (A to 10 fA aliquot of the reaction mixture was analyzed by 

25 electrophoresis on a 1 % agarose mini-gel to determine which reactions were successful in extending 
the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 384-well plates, 
digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 18 vector (Amersham Pharmacia Biotech). For 

30 shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 

agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended clones 
were religated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector (Amersham 
Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fill-in restriction site 
overhangs, and transfected into competent E. coli cells. Transformed cells were selected on 

35 antibiotic-containing media, and individual colonies were picked and cultured overnight at 37 °C in 
384-well plates in LB/2x carb liquid media. 



The concentration of DNA in each well was determined by dispensing 100 ul PICOGREEN 
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The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 
(Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 
parameters: Step 1: 94°C, 3 min; Step 2; 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 

5 quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low DNA 
recoveries were reamplified using the same conditions as described above. Samples were diluted with 
20% dimethysulfoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer sequencing 
primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI PRISM 
BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 

10 In like manner, the polynucleotide sequences of SEQ ID NO:44-86 are used to obtain 5' 

regulatory sequences using the procedure above, along with oligonucleotides designed for such 
extension, and an appropriate genomic library. 
VIL Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:44-86 are employed to screen cDNAs, 

15 genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 base 
pairs, is specifically described, essentially the same procedure is used with larger nucleotide 
fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 /iCi of 
[y- 32 Pl adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 

20 (DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 

SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). An 
aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane-based 
hybridization analysis of human genomic DNA digested with one of the following endonucleases: Ase 
I, Bgl II, Eco RI, Pst I, Xba I, or Pvu H (DuPont NEN). 

25 The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 

membranes (Nytran Plus, Schleicher & Schuell, Durham NH). Hybridization is carried out for 16 
hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room temperature 
under conditions of up to, for example, 0.1 x saline sodium citrate and 0.5% sodium dodecyl sulfate. 
Hybridization patterns are visualized using autoradiography or an alternative imaging means and 

30 compared. 

VIII. Microarrays 

The linkage or synthesis of array elements upon a microarray can be achieved utilizing 
photolithography, piezoelectric printing (ink-jet printing, See, e.g., Baldeschweiler, supra), 
mechanical microspotting technologies, and derivatives thereof. The substrate in each of the 

35 aforementioned technologies should be uniform and solid with a non-porous surface (Schena (1999), 
supra) . Suggested substrates include silicon, silica, glass slides, glass chips, and silicon wafers. 
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Alternatively, a procedure analogous to a dot or slot blot may also be used to arrange and link 
elements to the surface of a substrate using thermal, UV, chemical, or mechanical bonding 
procedures. A typical array may be produced using available methods and machines well known to 
those of ordinary skill in the art and may contain any appropriate number of elements. (See, e.g., 

5 Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645; 
Marshall, A. and J. Hodgson (1998) Nat. Biotechnol. 16:27-31.) 

Full length cDNAs, Expressed Sequence Tags (ESTs), or fragments or oligomers thereof 
may comprise the elements of the microarray. Fragments or oligomers suitable for hybridization can 
be selected using software well known in the art such as LASERGENE software (DNASTAR). The 

10 array elements are hybridized with polynucleotides in a biological sample. The polynucleotides in the 
biological sample are conjugated to a fluorescent label or other molecular tag for ease of detection. 
After hybridization, nonhybridized nucleotides from the biological sample are removed, and a 
fluorescence scanner is used to detect hybridization at each array element. Alternatively, laser 
desorbtion and mass spectrometry may be used for detection of hybridization. The degree of 

15 complementarity and the relative abundance of each polynucleotide which hybridizes to an element on 
the microarray may be assessed. In one embodiment, microarray preparation and usage is described 
in detail below. 

Tissue or Cell Sample Preparation 

Total RNA is isolated from tissue samples using the guanidinium thiocyanate method and 

20 poly(A) + RNA is purified using the oligo-(dT) cellulose method. Each poly(A) + RNA sample is 

reverse transcribed using MMLV reverse-transcriptase, 0.05 pg/|il oligo-(dT) primer (21mer), IX first 
strand buffer, 0.03 units/(il RNase inhibitor, 500 uM dATP, 500 uM dGTP, 500 uM dTTP, 40 uM 
dCTP, 40 uM dCTP-Cy3 (BDS) or dCTP-Cy5 (Amersham Pharmacia Biotech). The reverse 
transcription reaction is performed in a 25 ml volume containing 200 ng poIy(A) + RNA with 

25 GEMB RIGHT kits (Incyte). Specific control poly(A)* RNAs are synthesized by in vitro transcription 
from non-coding yeast genomic DNA. After incubation at 37 °C for 2 hr, each reaction sample (one 
with Cy3 and another with Cy5 labeling) is treated with 2.5 ml of 0.5M sodium hydroxide and 
incubated for 20 minutes at 85 °C to the stop the reaction and degrade the RNA. Samples are purified 
using two successive CHROMA SPIN 30 gel filtration spin columns (CLONTECH Laboratories, Inc. 

30 (CLONTECH), Palo Alto CA) and after combining, both reaction samples are ethanol precipitated 
using 1 ml of glycogen (1 mg/ml), 60 ml sodium acetate, and 300 ml of 100% ethanol. The sample is 
then dried to completion using a SpeedVAC (Savant Instruments Inc., Holbrook NY) and 
resuspended in 14 ul 5X SSC/0.2% SDS. 
Microarray Preparation 

35 Sequences of the present invention are used to generate array elements. Each array element 

is amplified from bacterial cells containing vectors with cloned cDNA inserts. PCR amplification 
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uses primers complementary to the vector sequences flanking the cDNA insert. Array elements are 
amplified in thirty cycles of PCR from an initial quantity of 1-2 ng to a final quantity greater than 5 
ug. Amplified array elements are then purified using SEPHACRYL-400 (Amersham Pharmacia 
Biotech). 

5 Purified array elements are immobilized on polymer-coated glass slides. Glass microscope 

slides (Corning) are cleaned by ultrasound in 0.1% SDS and acetone, with extensive distilled water 
washes between and after treatments. Glass slides are etched in 4% hydrofluoric acid (VWR 
Scientific Products Corporation (VWR), West Chester PA), washed extensively in distilled water, and 
coated with 0.05% aminopropyl silane (Sigma) in 95% ethanol. Coated slides are cured in a 1 10°C 

10 oven. 

Array elements are applied to the coated glass substrate using a procedure described in US 
Patent No. 5,807,522, incorporated herein by reference. 1 ul of the array element DNA, at an average 
concentration of 100 ng/ui, is loaded into the open capillary printing element by a high-speed robotic 
apparatus. The apparatus then deposits about 5 nl of array element sample per slide. 

15 Microarrays are UV-crosslinked using a STRATALINKER UV-crossIinker (Stratagene). 

Microarrays are washed at room temperature once in 0.2% SDS and three times in distilled water. 
Non-specific binding sites are blocked by incubation of microarrays in 0.2% casein in phosphate 
buffered saline (PBS) (Tropix, Inc., Bedford MA) for 30 minutes at 60 °C followed by washes in 0.2% 
SDS and distilled water as before. 

20 Hybridization 

Hybridization reactions contain 9 ul of sample mixture consisting of 0.2 ug each of Cy3 and 
Cy5 labeled cDNA synthesis products in 5X SSC, 0.2% SDS hybridization buffer. The sample 
mixture is heated to 65 °C for 5 minutes and is aliquoted onto the microarray surface and covered with 
an 1 .8 cm 2 coverslip. The arrays are transferred to a waterproof chamber having a cavity just slightly 
25 larger than a microscope slide. The chamber is kept at 100% humidity internally by the addition of 
140 ul of 5X SSC in a corner of the chamber. The chamber containing the arrays is incubated for 
about 6.5 hours at 60°C The arrays are washed for 10 min at 45 °C in a first wash buffer (IX SSC, 
0.1% SDS), three times for 10 minutes each at 45 °C in a second wash buffer (0.1X SSC), and dried. 
Detection 

30 Reporter-labeled hybridization complexes are detected with a microscope equipped with an 

Innova 70 mixed gas 10 W laser (Coherent, Inc., Santa Clara CA) capable of generating spectral lines 
at 488 nm for excitation of Cy3 and at 632 nm for excitation of Cy5. The excitation laser light is 
focused on the array using a 20X microscope objective (Nikon, Inc., Melville NY). The slide 
containing the array is placed on a computer-controlled X-Y stage on the microscope and raster- 

35 scanned past the objective. The 1 .8 cm x 1.8 cm array used in the present example is scanned with a 
resolution of 20 micrometers. 
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In two separate scans, a mixed gas multiline laser excites the two fluorophores sequentially. 
Emitted light is split, based on wavelength, into two photomultiplier tube detectors (PMT R1477, 
Hamamatsu Photonics Systems, Bridgewater NJ) corresponding to the two fluorophores. Appropriate 
filters positioned between the array and the photomultiplier tubes are used to filter the signals. The 
5 emission maxima of the fluorophores used are 565 nm for Cy3 and 650 nm for Cy5. Each array is 
typically scanned twice, one scan per fluorophore using the appropriate filters at the laser source, 
although the apparatus is capable of recording the spectra from both fluorophores simultaneously. 

The sensitivity of the scans is typically calibrated using the signal intensity generated by a 
cDNA control species added to the sample mixture at a known concentration. A specific location on 
10 the array contains a complementary DNA sequence, allowing the intensity of the signal at that 

location to be correlated with a weight ratio of hybridizing species of 1:100.000. When two samples 
from different sources (e.g., representing test and control cells), each labeled with a different 
fluorophore, are hybridized to a single array for the purpose of identifying genes that are differentially 
expressed, the calibration is done by labeling samples of the calibrating cDNA with the two 
15 fluorophores and adding identical amounts of each to the hybridization mixture. 

The output of the photomultiplier tube is digitized using a 12-bit RTI-835H analog-to-digital 
(A/D) conversion board (Analog Devices, Inc., Norwood MA) installed in an IBM-compatible PC 
computer. The digitized data are displayed as an image where the signal intensity is mapped using a 
linear 20-color transformation to a pseudocolor scale ranging from blue (low signal) to red (high 
20 signal). The data is also analyzed quantitatively. Where two different fluorophores are excited and 
measured simultaneously, the data are first corrected for optical crosstalk (due to overlapping 
emission spectra) between the fluorophores using each fluorophore' s emission spectrum. 

A grid is superimposed over the fluorescence signal image such that the signal from each spot 
is centered in each element of the grid. The fluorescence signal within each element is then integrated 
25 to obtain a numerical value corresponding to the average intensity of the signal. The software used 
for signal analysis is the GEMTOOLS gene expression analysis program (Incyte). 
K. Complementary Polynucleotides 

Sequences complementary to the TPPT-encoding sequences, or any parts thereof, are used to 
detect, decrease, or inhibit expression of naturally occurring TPPT. Although use of oligonucleotides 
30 comprising from about 15 to 30 base pairs is described, essentially the same procedure is used with 
smaller or with larger sequence fragments. Appropriate oligonucleotides are designed using OLIGO 
4.06 software (National Biosciences) and the coding sequence of TPPT. To inhibit transcription, a 
complementary oligonucleotide is designed from the most unique 5' sequence and used to prevent 
promoter binding to the coding sequence. To inhibit translation, a complementary oligonucleotide is 
35 designed to prevent ribosomal binding to the TPPT-encoding transcript. 
X. Expression of TPPT 
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Expression and purification of TPPT is achieved using bacterial or virus-based expression 
systems. For expression of TPPT in bacteria, cDNA is subcloned into an appropriate vector 
containing an antibiotic resistance gene and an inducible promoter that directs high levels of cDNA 
transcription. Examples of such promoters include, but are not limited to, the trp-lac (tac) hybrid 

5 promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac operator regulatory 
element. Recombinant vectors are transformed into suitable bacterial hosts, e.g., BL21(DE3). 
Antibiotic resistant bacteria express TPPT upon induction with isopropyl beta-D- 
thiogalactopyranoside (IPTG). Expression of TPPT in eukaryotic cells is achieved by infecting insect 
or mammalian cell lines with recombinant Autographica californica nuclear polyhedrosis virus 

10 (AcMNPV), commonly known as baculo virus. The nonessential polyhedrin gene of baculo virus is 
replaced with cDNA encoding TPPT by either homologous recombination or bacterial-mediated 
transposition involving transfer plasmid intermediates. Viral infectivity is maintained and the strong 
polyhedrin promoter drives high levels of cDNA transcription. Recombinant baculovirus is used to 
infect Spodoptera frugiperda (Sf9) insect cells in most cases, or human hepatocytes, in some cases. 

15 Infection of the latter requires additional genetic modifications to baculovirus. (See Engelhard, E.K. 
et al. (1994) Proc. Natl. Acad. Sci. USA 91 :3224-3227; Sandig, V. et al. (1996) Hum. Gene Ther. 
7:1937-1945.) 

In most expression systems, TPPT is synthesized as a fusion protein with, e.g., glutathione S- 
transferase (GST) or a peptide epitope tag, such as FLAG or 6-His, permitting rapid, single-step, 

20 affinity-based purification of recombinant fusion protein from crude cell lysates. GST, a 26- 
kilodalton enzyme from Schistosoma iaponicum . enables the purification of fusion proteins on 
immobilized glutathione under conditions that maintain protein activity and antigenicity (Amersham 
Pharmacia Biotech). Following purification, the GST moiety can be proteolytically cleaved from 
TPPT at specifically engineered sites. FLAG, an 8-amino acid peptide, enables immunoaffmity 

25 purification using commercially available monoclonal and polyclonal anti-FLAG antibodies (Eastman 
Kodak). 6-His, a stretch of six consecutive histidine residues, enables purification on metal-chelate 
resins (QIAGEN). Methods for protein expression and purification are discussed in Ausubel (1995, 
supra , ch. 10 and 16). Purified TPPT obtained by these methods can be used directly in the assays 
shown in Examples XI and XV. 

30 XI. Demonstration of TPPT Activity 

TPPT transport activity is assayed by measuring uptake of labeled substrates into Xenopus 
laevis oocytes. Oocytes at stages V and VI are injected with TPPT mRNA (10 ng per oocyte) and 
incubated for 3 days at 18°C in OR2 medium (82.5mM NaCl, 2.5 mM KCI, ImM CaCl 2 , lmM MgCl 2 , 
ImM Na 2 HP0 4 , 5 mM Hepes, 3.8 mM NaOH , 50ug/ml gentamycin, pH 7.8) to allow expression of 

35 TPPT. Oocytes are then transferred to standard uptake medium (lOOmM NaCl, 2 mM KCI, ImM 
CaCl 2 , ImM MgCk 10 mM Hepes/Tris pH 7.5). Uptake of various substrates (e.g., amino acids, 
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sugars, drugs, ions, and neurotransmitters) is initiated by adding labeled substrate (e.g. radiolabeled 
with 3 H, fluorescently labeled with rhodamine, etc.) to the oocytes. After incubating for 30 minutes, 
uptake is terminated by washing the oocytes three times in Na + -free medium, measuring the 
incorporated label, and comparing with controls. TPPT activity is proportional to the level of 
5 internalized labeled substrate. 
XII. Functional Assays 

TPPT function is assessed by expressing the sequences encoding TPPT at physiologically 
elevated levels in mammalian cell culture systems. cDNA is subcloned into a mammalian expression 
vector containing a strong promoter that drives high levels of cDNA expression. Vectors of choice 
10 include pCMV SPORT plasmid (Life Technologies) and pCR3. 1 plasmid (Invitrogen), both of which 
contain the cytomegalovirus promoter. 5-10 fxg of recombinant vector are transiently transfected into 
a human cell line, for example, an endothelial or hematopoietic cell line, using either liposome 
formulations or electroporation. 1-2 y,% of an additional plasmid containing sequences encoding a 
marker protein are co-transfected. Expression of a marker protein provides a means to distinguish 
15 transfected cells from nontransfected cells and is a reliable predictor of cDNA expression from the 
recombinant vector. Marker proteins of choice include, e.g., Green Fluorescent Protein (GFP; 
Clontech), CD64, or a CD64-GFP fusion protein. Flow cytometry (FCM), an automated, laser optics- 
based technique, is used to identify transfected cells expressing GFP or CD64-GFP and to evaluate 
the apoptotic state of the cells and other cellular properties. FCM detects and quantifies the uptake of 
20 fluorescent molecules that diagnose events preceding or coincident with cell death. These events 
include changes in nuclear DNA content as measured by staining of DNA with propidium iodide; 
changes in cell size and granularity as measured by forward light scatter and 90 degree side light 
scatter; down-regulation of DNA synthesis as measured by decrease in bromodeoxyuridine uptake; 
alterations in expression of cell surface and intracellular proteins as measured by reactivity with 
25 specific antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M.G. (1994) How Cytometry , Oxford, New York NY. 

The influence of TPPT on gene expression can be assessed using highly purified populations 
of cells transfected with sequences encoding TPPT and either CD64 or CD64-GFP. CD64 and CD64- 
30 GFP are expressed on the surface of transfected cells and bind to conserved regions of human 

immunoglobulin G (IgG). Transfected cells are efficiently separated from nontransfected cells using 
magnetic beads coated with either human IgG or antibody against CD64 (DYNAL, Lake Success 
NY). mRNA can be purified from the cells using methods well known by those of skill in the art. 
Expression of mRNA encoding TPPT and other genes of interest can be analyzed by northern analysis 
35 or microarray techniques. 
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XIII. Production of TPPT Specific Antibodies 

TPPT substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
Harrington, M.G. (1990) Methods Enzymol. 182:488-495), or other purification techniques, is used to 
immunize rabbits and to produce antibodies using standard protocols. 
5 Alternatively, the TPPT amino acid sequence is analyzed using LASERGENE software 

(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
selection of appropriate epitopes, such as those near the C-terminus or in hydrophilic regions are well 
described in the art. (See, e.g., Ausubel, 1995, supra, ch. 11.) 
10 Typically, oligopeptides of about 15 residues in length are synthesized using an ABI 431A 

peptide synthesizer (PE Biosystems) using FMOC chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 
immunogenicity. (See, e.g., Ausubel, 1995, supra.) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide and anti- 
15 TPPT activity by, for example, binding the peptide or TPPT to a substrate, blocking with 1 % BS A, 
reacting with rabbit antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XIV. Purification of Naturally Occurring TPPT Using Specific Antibodies 

Naturally occurring or recombinant TPPT is substantially purified by immunoaffinity 
chromatography using antibodies specific for TPPT. An immunoaffinity column is constructed by 
20 covalently coupling anti-TPPT antibody to an activated chromatographic resin, such as 

CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

Media containing TPPT are passed over the immunoaffinity column, and the column is 
washed under conditions that allow the preferential absorbance of TPPT (e.g., high ionic strength 
25 buffers in the presence of detergent). The column is eluted under conditions that disrupt 

antibody/TPPT binding (e.g., a buffer of pH 2 to pH 3. or a high concentration of a chaotrope, such as 

urea or thiocyanate ion), and TPPT is collected. 

XV. Identification of Molecules Which Interact with TPPT 

TPPT, or biologically active fragments thereof, are labeled with '"I Bolton-Hunter reagent. 
30 (See, e.g.. Bolton A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate molecules 

previously arrayed in the wells of a multi-well plate are incubated with the labeled TPPT, washed, and 
any wells with labeled TPPT complex are assayed. Data obtained using different concentrations of 
TPPT are used to calculate values for the number, affinity, and association of TPPT with the 
candidate molecules. 

35 Alternatively, molecules interacting with TPPT are analyzed using the yeast two-hybrid 

system as described in Fields. S. and O. Song (1989, Nature 340:245-246), or using commercially 
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available kits based on the two-hybrid system, such as the MATCHMAKER system (C.on.ech). 

TPFT may also be used in the PATHCALLING process (CuraGen Corp., New Haven CD 
which employs the yeast two-hybrid system in a high-throughput manner to determine all interacts 
between the proteins encoded by two large libraries of genes (Nandabalan, K. et al. (2000) U.S. Patent 
5 No. 6,057,101). 

Various modifications and variations of the described methods and systems of the invention 
will be apparent to those skilled in the art without departing from the scope and spirit of the mvent.on. 
Although the invention has been described in connection with certain embodiments, U should be 
10 understood that the invention as claimed should not be unduly limited to such specific embod.ments. 
Indeed, various modifications of the described modes for carrying out the invention which are obvtous 
to those skilled in molecular biology or related fields are intended to be within the scope of the 
following claims. 
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What is claimed is: 

1 . An isolated polypeptide comprising an amino acid sequence selected from the group 
consisting of: 

5 a) an amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID 

NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, 
SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, 
SEQ ID NO: 17, SEQ ID NO:18, SEQ ID NO.19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, 
SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, 

10 SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, 
SEQ ID NO.35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, 
and SEQ ID NO:43, 

b) a naturally occurring amino acid sequence having at least 90% sequence identity to an 
amino acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID 

15 NO:3. SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO:10, 
SEQ ID NO:12, SEQ ID NO:13, SEQ ID NO:14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, 
SEQ ID NO:18, SEQ ID NO:19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, 
SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, 
SEQ ID NO:30, SEQ ID NO:31 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, 

20 SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID 
NO:43, 

c) a biologically active fragment of an amino acid sequence selected from the group 
consisting of SEQ ID NO: 1 , SEQ ID NO:2. SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID 
NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO:12, SEQ ID NO: 13, SEQ ID 

25 NO: 14, SEQ ID NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID 
NO:20, SEQ ID NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID 
NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO.30, SEQ ID NO:31, SEQ ID 
NO:32, SEQ ID NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID 
NO:39, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID NO:43, and 

30 d) an immunogenic fragment of an amino acid sequence selected from the group consisting 

of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ 
ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID 
NO: 15, SEQ ID NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID 
NO:21, SEQ ID NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID 

35 NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:31, SEQ ID NO:32. SEQ ID 
NO:33, SEQ ID NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID 
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NO:4K SEQ ID NO:42, and SEQ ID NO:43. 

2. An isolated polypeptide of claim 1 selected from the group consisting of SEQ ID NO:l, 
SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID 

5 NO:9, SEQ ID NO: 10, SEQ ID NO: 12, SEQ ID NO: 13, SEQ ID NO: 14, SEQ ID NO: 15, SEQ ID 
NO: 16, SEQ ID NO: 17, SEQ ID NO: 18, SEQ ID NO: 19, SEQ ID NO:20, SEQ ID NO:21, SEQ ID 
NO:22, SEQ ID NO:23, SEQ ID NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID 
NO:28, SEQ ID NO:29, SEQ ID NO:30, SEQ ID NO:3 1 , SEQ ID NO:32, SEQ ID NO:33, SEQ ID 
NO:34, SEQ ID NO:35, SEQ ID NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID 

10 NO:42, and SEQ ID NO:43. 

3. An isolated polynucleotide encoding a polypeptide of claim 1. 

4. An isolated polynucleotide encoding a polypeptide of claim 2. 

15 

5. An isolated polynucleotide of claim 4 selected from the group consisting of SEQ ID 
NO:44, SEQ ID NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID 
NO:51, SEQ ID NO:52, SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID 
NO:58, SEQ ID NO:59, SEQ ID NO:60, SEQ ID NO:61 , SEQ ID NO:62, SEQ ID NO:63, SEQ ID 

20 NO:64, SEQ ID NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID 
NO:70, SEQ ID NO:71 , SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID 
NO:76, SEQ ID NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:81, SEQ ID NO:82, SEQ ID 
NO:84, SEQ ID NO:85, SEQ ID NO:86. 

25 6. A recombinant polynucleotide comprising a promoter sequence operably linked to a 

polynucleotide of claim 3. 

7. A cell transformed with a recombinant polynucleotide of claim 6. 

30 8. A transgenic organism comprising a recombinant polynucleotide of claim 6. 

9. A method for producing a polypeptide of claim I , the method comprising: 
a) culturing a cell under conditions suitable for expression of the polypeptide, wherein said 
cell is transformed with a recombinant polynucleotide, and said recombinant polynucleotide 
35 comprises a promoter sequence operably linked to a polynucleotide encoding the polypeptide of claim 
1, and 
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b) recovering the polypeptide so expressed. 

10. An isolated antibody which specifically binds to a polypeptide of claim 1 . 

5 1 1 . An isolated polynucleotide comprising a polynucleotide sequence selected from the 

group consisting of: 

a) a polynucleotide sequence selected from the group consisting of SEQ ID NO:44, SEQ ID 
NO:45, SEQ ID NO:46, SEQ ID NO:47, SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID 
NO:52, SEQ ID NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID 
10 NO:59, SEQ ID NO:60, SEQ ID NO:61 . SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID 
NO:65, SEQ ID NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID 
NO:71. SEQ ID NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID 
NO:77, SEQ ID NO:78, SEQ ID NO:79, SEQ ID NO:8 1 , SEQ ID NO:82, SEQ ID NO:84, SEQ ID 
NO:85, SEQ ID NO:86, 

15 b) a naturally occurring polynucleotide sequence having at least 70% sequence identity to a 

polynucleotide sequence selected from the group consisting of SEQ ID NO:44, SEQ ID NO:45, SEQ 
ID NO:46, SEQ ID NO:47 t SEQ ID NO:49, SEQ ID NO:50, SEQ ID NO:51, SEQ ID NO:52, SEQ ID 
NO:53, SEQ ID NO:55, SEQ ID NO:56, SEQ ID NO:57, SEQ ID NO:58, SEQ ID NO:59, SEQ ID 
NO:60, SEQ ID NO:61 , SEQ ID NO:62, SEQ ID NO:63, SEQ ID NO:64, SEQ ID NO:65, SEQ ID 

20 NO:66, SEQ ID NO:67, SEQ ID NO:68, SEQ ID NO:69, SEQ ID NO:70, SEQ ID NO:7 1 , SEQ ID 
NO:72, SEQ ID NO:73, SEQ ID NO:74, SEQ ID NO:75, SEQ ID NO:76, SEQ ID NO:77, SEQ ID 
NO:78, SEQ ID NO:79, SEQ ID NO:81, SEQ ID NO:82, SEQ ID NO:84, SEQ ID NO:85, SEQ ID 
NO:86, 

c) a polynucleotide sequence complementary to a), 
25 d) a polynucleotide sequence complementary to b), and 

e) an RNA equivalent of a)-d). 

12. An isolated polynucleotide comprising at least 60 contiguous nucleotides of a 
polynucleotide of claim 1 1 . 

30 

13. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) hybridizing the sample with a probe comprising at least 20 contiguous nucleotides 
comprising a sequence complementary to said target polynucleotide in the sample, and which probe 
35 specifically hybridizes to said target polynucleotide, under conditions whereby a hybridization 
complex is formed between said probe and said target polynucleotide or fragments thereof, and 
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b) detecting the presence or absence of said hybridization complex, and, optionally, if 
present, the amount thereof. 

14. A method of claim 13, wherein the probe comprises at least 60 contiguous nucleotides. 



15. A method for detecting a target polynucleotide in a sample, said target polynucleotide 
having a sequence of a polynucleotide of claim 1 1, the method comprising: 

a) amplifying said target polynucleotide or fragment thereof using polymerase chain reaction 
amplification, and 



thereof, and, optionally, if present, the amount thereof. 

16. A pharmaceutical composition comprising an effective amount of a polypeptide of claim 
1 and a pharmaceutical ly acceptable excipient. 



17. A pharmaceutical composition of claim 16, wherein the polypeptide comprises an amino 
acid sequence selected from the group consisting of SEQ ID NO: 1 , SEQ ID NO:2, SEQ ID NO:3, 
SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID NO: 10, SEQ ID 
NO:12, SEQ ID NO:13, SEQ ID NO: 14, SEQ ID NO:15, SEQ ID NO:16, SEQ ID NO:17, SEQ ID 
20 NO: 1 8, SEQ ID NO: 1 9, SEQ ID NO:20, SEQ ID NO:21 , SEQ ID NO:22, SEQ ID NO:23, SEQ ID 
NO:24, SEQ ID NO:25, SEQ ID NO:26, SEQ ID NO:27, SEQ ID NO:28, SEQ ID NO:29, SEQ ID 
NO:30, SEQ ID NO:3 1, SEQ ID NO:32, SEQ ID NO:33, SEQ ID NO:34 T SEQ ID NO:35, SEQ ID 
NO:36, SEQ ID NO:38, SEQ ID NO:39, SEQ ID NO:41, SEQ ID NO:42, and SEQ ID NO:43. 

25 18. A method for treating a disease or condition associated with decreased expression of 

functional TPPT, comprising administering to a patient in need of such treatment the pharmaceutical 
composition of claim 16. 



5 



10 



b) detecting the presence or absence of said amplified target polynucleotide or fragment 



15 



1 9. A method for screening a compound for effectiveness as an agonist of a polypeptide 
30 claim K the method comprising: 



a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting agonist activity in the sample. 



20. A pharmaceutical composition comprising an agonist compound identified by a method 
35 of claim 19 and a pharmaceutical ly acceptable excipient. 



102 



PCT/US00/16668 

WO 00/78953 

21. A method for treating a disease or condition associated with decreased expression of 
functional TPPT, comprising administering to a patient in need of such treatment a pharmaceutical 
composition of claim 20. 

5 22. A method for screening a compound for effectiveness as an antagonist of a polypeptide 

of claim 1, the method comprising: 

a) exposing a sample comprising a polypeptide of claim 1 to a compound, and 

b) detecting antagonist activity in the sample. 

10 23. A pharmaceutical composition comprising an antagonist compound identified by a 

method of claim 22 and a pharmaceutically acceptable excipient. 

24. A method for treating a disease or condition associated with overexpression of functional 
TPPT, comprising administering to a patient in need of such treatment a pharmaceutical composition 

15 of claim 23. 

25. A method of screening for a compound that specifically binds to the polypeptide of claim 
1, said method comprising the steps of: 

a) combining the polypeptide of claim 1 with at least one test compound under suitable 

20 conditions, and 

b) detecting binding of the polypeptide of claim 1 to the test compound, thereby identifying a 
compound that specifically binds to the polypeptide of claim 1. 

26. A method of screening for a compound that modulates the activity of the polypeptide of 
25 claim 1 , said method comprising: 

a) combining the polypeptide of claim 1 with at least one test compound under conditions 
permissive for the activity of the polypeptide of claim 1 , 

b) assessing the activity of the polypeptide of claim 1 in the presence of the test compound, 

and 

30 c) comparing the activity of the polypeptide of claim 1 in the presence of the test compound 

with the activity of the polypeptide of claim 1 in the absence of the test compound, wherein a change 
in the activity of the polypeptide of claim 1 in the presence of the test compound is indicative of a 
compound that modulates the activity of the polypeptide of claim 1 . 

35 27. A method for screening a compound for effectiveness in altering expression of a target 

polynucleotide, wherein said target polynucleotide comprises a sequence of claim 5, the method 
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comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 

28. An isolated polynucleotide comprising a polynucleotide sequence of SEQ ID NO:83. 

29. A recombinant polynucleotide comprising a promoter sequence operably linked to a 
polynucleotide of claim 28. 

30. A cell transformed with a recombinant polynucleotide of claim 29. 

31 . A transgenic organism comprising a recombinant polynucleotide of claim 29. 

32. A method for producing a polypeptide comprising an amino acid sequence of SEQ ID 
NO:40, the method comprising: 

a) culturing the cell of claim 30 under conditions suitable for expression of the polypeptide, 

and 

b) recovering the polypeptide so expressed. 

33. A method for screening a compound for effectiveness in altering expression of a target 
polynucleotide, wherein said target polynucleotide comprises a sequence of claim 28, the method 
comprising: 

a) exposing a sample comprising the target polynucleotide to a compound, and 

b) detecting altered expression of the target polynucleotide. 
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155 








160 










165 


Arg 


Pro 


Met 


Thr 


Ala 


Lys 


Lys 


Lys 


Pro 


Asn 


Phe 


He 


Leu 


Leu 


Leu 








170 








175 










180 


Lys 


Ala 


Leu 


Leu 


Leu 


Ser 


Ser 


Tyr 


Gly 


Lys 


Leu 


Leu 


Leu 


He 


Pro 








185 










190 










195 


Ala 


Val 


He 


Trp 


Glu 


His 


Asp 


Tyr 


Thr 


Ser 


Val 


Cys 


Leu 


Lys 


Leu 








200 










205 










210 


He 


Lys 


Val 


Phe 


Val 


Leu 


Thr 


Ser 


Asn 


Phe 


Gin 


Ala 


He 


Arg 


Val 








215 










220 










225 


Thr 


Leu 


Asn 


He 


Asn 


Arg 


Lys 


Leu 


Ser 


Phe 


Leu 


Ala 


Val 


Leu 


Ser 










230 










235 










240 


Gly Leu 


Leu 


Leu 


Glu 


Ser 


He 


Met 


Val 


Tyr 


Phe 


Phe 


Gin 


Ser 


Met 










245 










250 










255 


Glu Trp Asp Val 


Gly 


Ser 


Asp 


Tyr 


Ala 


He 


Phe 


Lys 


Ser 


Gin 


Asp 










260 










265 










270 



Phe 



<210> 5 
<211> 323 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2745212CD1 



<400> 5 

Met Ala Pro Lys Gin Asp Pro Lys Pro Lys Phe Gin Glu Gly Glu 

15 10 15 

Arg Val Leu Cys Phe His Gly Pro Leu Leu Tyr Glu Ala Lys Cys 

20 25 30 

Val Lys Val Ala He Lys Asp Lys Gin Val Lys Tyr Phe He His 
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Gly 




35 










40 










45 


Tyr Ser 


Trp Asn 


Lys 


Asn 


Trp 


Asp 


Glu 


Trp 


Val 


Pro 


Glu 


Ser 








50 










55 








60 


Arg Val 


Leu 


Lys 


Tyr 


Val 


Asp 


Thr 


Asn 


Leu 


Gin 


Lys 


Gin 


Arg 


Glu 








65 










70 






75 


Leu Gin 


Lys 


Ala 


Asn 


Gin 


Glu 


Gin 


Tyr 


Ala 


Glu Gly Lys Met 


Arg 


Gly Ala 






80 










85 










90 


Ala 


Pro Gly 


Lys 


Lys 


Thr 


Ser Gly 


Leu 


Gin 


Gin 


Lys 


Asn 


Val Glu 






95 










100 








105 


Val 


Lys 


Thr 


Lys 


Lys 


Asn 


Lys 


Gin 


Lys 


Thr 


Pro Gly 


Asn 


Gly Asp 






110 










115 










120 


Gly 


Gly Ser 


Thr 


Ser 


Glu 


Thr 


Pro 


Gin 


Pro 


Pro 


Arg 


Lys 




Ala 




125 










130 








135 


Lys Arg 


Arg 


Val 


Asp 


Pro 


Thr 


Val 


Glu 


Asn 


Glu 


Glu 


Thr 


Phe 








140 










145 










150 


Met Asn 


Arg 


Val 


Glu 


Val 


Lys 


Val 


Lys 


He 


Pro 


Glu 


Glu 


Leu 


Lys 








155 










160 










165 


Pro Trp 


Leu 


Val 


Asp 


Asp 


Trp 


Asp 


Leu 


He 


Thr 


Arg 




j_>ys 


Gin 


Leu Phe 






170 










175 






180 


Tyr 


Leu 


Pro 


Ala 


Lys 


Lys 


Asn 


Val 


Asp 


Ser 


lie 


Leu 


Glu 




Ala 




185 










190 








195 


Asp Tyr 


Asn 


Tyr 


Lys 


Lys 


Ser 


Arg Gly 


Asn 


Thr 


Asp 


Asn 


Lys 


Glu Tyr 






200 










205 










210 


Ala 


Val 


Asn 


Glu 


Val 


Val 


Ala Gly 


He 


Lys 


Glu 


Tyr 


Phe 








215 










220 








225 


Asn Val 


Met 


Leu Gly 


Thr 


Gin 


Leu 


Leu 


Tyr 


Lys 


Phe 


Glu 


Arg 


Pro 


Gin Tyr 


Ala 




230 










235 








240 


Glu 


lie 


Leu 


Ala 


Asp 


His 


Pro 


Asp Ala 


Pro 


Met 


Ser 


Gin Val 






245 










250 










255 


Tyr 


Gly Ala 


Pro 


His 


Leu 


Leu 


Arg 


Leu 


Phe 


val 


Arg 


He 


Gly Ala 






260 










265 










Met 


Leu 


Ala 


Tyr 


Thr 


Pro 


Leu 


Asp 


Glu 


Lys 


Ser 


Leu 


Ala 








275 










280 








2 85 


Leu Leu 


Leu 


Asn 


Tyr 


Leu 


His 


Asp 


Phe 


Leu 


Lys 


Tyr 


Leu 


Ala 


Lys 








290 










295 








300 


Asn Ser 


Ala 


Thr 


Leu 


Phe 


Ser 


Ala 


Ser 


Asp 


Tyr 


Glu 


Val 


Ala 


Pro 








305 










310 








315 


Pro Glu 


Tyr 


His 


Arg 


Lys 


Ala 


Val 






















320 






















<210> 6 




























<211> 274 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> misc_feature 






















<223> Incyte ID 


No: 


4833111CD1 
















<400> 6 




























Met Asp 


Arg 


Pro 


Gly 


Phe 


Val 


Ala 


Ala 


Leu 


Val Ala Gly Gly 


Val 


1 






5 










10 










15 


Ala Gly 


Val 


Ser 


Val 


Asp 


Leu 


He 


Leu 


Phe 


Pro 


Leu 


Asp 


Thr 


He 


Lys Thr 






20 










25 








30 


Arg 


Leu 


Gin 


Ser 


Pro 


Gin Gly 


Phe 


Ser 


Lys 


Ala 


Gly 


Gly 


Phe His 






35 










40 








45 


Gly 


lie 


Tyr 


Ala 


Gly 


Val 


Pro 


Ser 


Ala 


Ala 


He 


Gly 


Ser 


Phe Pro 






50 










55 








60 


Asn 


Ala 


Ala 


Ala 


Phe 


Phe 


He 


Thr 


Tyr 


Glu 


Tyr 


Val 


Lys 


Trp Phe 






65 










70 








75 


Leu 


His 


Ala 


Asp 


Ser 


Ser 


Ser 


Tyr 


Leu 


Thr 


Pro 


Met 


Lys 


His Met 






80 










85 










90 


Leu 


Ala 


Ala 


Ser 


Ala 


Gly Glu Val 


Val 


Ala 


Cys 


Leu 


He 


Arg Val 






95 










100 








105 


Pro 


Ser 


Glu 


Val 


Val 


Lys 


Gin 


Arg 


Ala 


Gin 


Val 


Ser 


Ala 


Ser Thr 






110 










115 










120 


Arg 


Thr 


Phe 


Gin 


lie 


Phe 


Ser 


Asn 


He 


Leu 


Tyr 


Glu 


Glu 


Gly lie 






125 










130 








135 


Gin 


Gly 


Leu 


Tyr 


Arg 


Gly 


Tyr 


Lys 


Ser 


Thr 


Val 


Leu 


Arg 
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140 






Glu 


He 


Pro 


Phe 


Ser 
155 


Leu Val 


Gin Phe 


Lys 


Ala 


Leu 


Trp 


Ser 
170 


Trp Arg 


Gin Asp 


Gin 


Ser 


Ala 


Val 


Cys 
185 


Gly Ala 


Phe Ala 


Val 


Thr 


Thr 


Pro 


Leu 


Asp Val 


Ala Lys 










200 




Lys 


Ala 


pit; 


Ser 


Ser 
215 


Thr Ala 


Asp Gly 


His 


Gly 


Val 


Trp 


Arg 


Ser Gin Gly Leu 










230 






Val 


Phe 


Pro 


Arg 


Met 
245 


Ala Ala 


He Ser 


Leu Gly 


Ala 


Tyr 


Asp 


Arg Thr 


His Ser 










260 






Arg 


Lys 


Ser 


Pro 









<210> 7 

<211> 291 

<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 876677CD1 

<400> 7 



Met 


Asp 


Ser 


Arg 


Val 


Ser 


Ser 


Pro 


Glu 


i 
i 


















Phe 


Val 


Gly 


Val 


Asn 


Asn 


Lys 


Arg 


Leu 










20 










Leu 


Phe 


Ser 


Leu 


Ser 


Phe 


Leu 


Leu 


Val 










35 










Ser 


He 


Trp 


Met 


Cys 


Leu 


Lys 


He 


He 










50 










Val 


Val 


Phe 


Arg 


Leu 


Gly 


Arg 


He 


Gin 










65 










Pro 


Gly 


Leu 


He 


Leu 


Val 


Leu 


Pro 


Cys 










80 








Val 


Asp 


Leu 


Arg 


Thr 


Val 


Thr 


Cys 


Asn 










95 










Leu 


Thr 


Arg 


Asp 


Ser 


Val 


Thr 


Thr 


Gin 










110 










Tyr 


Arg 


He 


Tyr 


Ser 


Ala 


Val 


Ser 


Ala 










125 










Val 


His 


Gin 


Ala 


Thr 


Phe 


Leu 


Leu 


Ala 










140 










Val 


Leu 


Gly 


Thr 


Gin 


Thr 


Leu 


Ser 


Gin 










155 










Glu 


He 


Ala 


His 


Ser 


He 


Gin 


Thr 


Leu 










170 










Leu 


Trp 


Gly 


He 


Arg 


Val 


Ala 


Arg Val 










185 










He 


Pro 


Val 


Gin 


Leu 


Gin 


Arg 


Ser 


Met 










200 








Thr 


Arg 


Glu 


Ala 


Arg 


Ala 


Lys 


Val 


Leu 










215 










Asn 


Ala 


Ser 


Lys 


Ser 


Leu 


Lys 


Ser 


Ala 










230 










Ser 


Pro 


He 


Ala 


Leu 


Gin 


Leu 


Arg 


Tyr 










245 








Val 


Ala 


Thr 


Glu 


Lys 


Asn 


Ser 


Thr 


He 










260 










Asn 


He 


Leu 


Glu 


Gly 


He 


Gly 


Gly Val 










275 










Lys 


Leu 


Pro 


Asn 


Lys 


Ala 










PCT/US00/16668 



145 








150 


Pro 


Leu Trp 


Glu 


Ser 


Leu 


160 








165 


His 


Val Val 


Asp 


Ser 


Trp 


175 








180 


Gly 


Gly Phe 


Ala 


Ala 


Ala 


190 








195 


Thr 


Arg He 


Thr 


Leu 


Ala 


205 








210 


Asn 


Val Leu 


Ser 


Val 


Leu 


220 








225 


Ala 


Gly Leu 


Phe 


Ala 


Gly 


235 








240 


Leu 


Gly Gly 


Phe 


He 


Phe 


250 








255 


Leu 


Leu Leu 


Glu 


Val 


Gly 


265 








270 



Lys 


Gin Asp 


Lys 


Glu 


Asn 


10 










15 


Gly 


Val 


Cys 


Gly 


Trp 


He 


25 










30 


He 


He 


Thr 


Phe 


Pro 


He 


40 










45 


Lys 


Glu 


Tyr 


Glu 


Arg 


Ala 


55 










60 


Ala 


Asp 


Lys 


Ala 


Lys 


Gly 


70 










75 


He 


Asp 


Val 


Phe 


Val 


Lys 


85 










90 


He 


Pro 


Pro 


Gin 


Glu 


lie 


100 










105 


Val 


Asp Gly Val 


Val 


Tyr 


115 










120 


Val 


Ala 


Asn 


Val 


Asn 


Asp 


130 










135 


Gin 


Thr 


Thr 


Leu 


Arg 


Asn 


145 










150 


He 


Leu Ala Gly Arg 


Glu 


160 










165 


Leu 


Asp 


Asp 


Ala 


Thr 


Glu 


175 










180 


Glu 


He 


Lys 


Asp 


Val 


Arg 


190 










195 


Ala 


Ala 


Glu 


Ala 


Glu 


Ala 


205 










210 


Ala 


Ala Glu Gly Glu 


Met 


220 










225 


Ser 


Met 


Val 


Leu 


Ala 


Glu 


235 










240 


Leu 


Gin 


Thr 


Leu 


Ser 


Thr 


250 










255 


Val 


Phe 


Pro 


Leu 


Pro 


Met 


265 










270 


Ser 


Tyr 


Asp 


Asn 


His 


Lys 


280 










285 
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290 



<210> 8 
<211> 381 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2326143CD1 

<400> 8 

Met Ser Arg His Glu Gly Val Ser Cys Asp Ala Cys Leu Lys Gly 
15 10 15 

Asn Phe Arg Gly Arg Arg Tyr Lys Cys Leu lie Cys Tyr Asp Tyr 
20 25 30 

Asp Leu Cys Ala Ser Cys Tyr Glu Ser Gly Ala Thr Thr Thr Arg 
35 40 45 

His Thr Thr Asp His Pro Met Gin Cys lie Leu Thr Arg Val Asp 
50 55 60 

Phe Asp Leu Tyr Tyr Gly Gly Glu Ala Phe Ser Val Glu Gin Pro 
65 70 75 

Gin Ser Phe Thr Cys Pro Tyr Cys Gly Lys Met Gly Tyr Thr Glu 
80 85 90 

Thr Ser Leu Gin Glu His Val Thr Ser Glu His Ala Glu Thr Ser 
95 100 105 

Thr Glu Val lie Cys Pro lie Cys Ala Ala Leu Pro Gly Gly Asp 

110 115 120 

Pro Asn His Val Thr Asp Asp Phe Ala Ala His Leu Thr Leu Glu 

125 130 135 

His Arg Ala Pro Arg Asp Leu Asp Glu Ser Ser Gly Val Arg His 

140 145 150 

Val Arg Arg Met Phe His Pro Gly Arg Gly Leu Gly Gly Pro Arg 

155 160 165 

Ala Arg Arg Ser Asn Met His Phe Thr Ser Ser Ser Thr Gly Gly 

170 175 180 

Leu Ser Ser Ser Gin Ser Ser Tyr Ser Pro Ser Asn Arg Glu Ala 

185 190 195 

Met Asp Pro lie Ala Glu Leu Leu Ser Gin Leu Ser Gly Val Arg 

200 205 210 

Arg Ser Ala Gly Gly Gin Leu Asn Ser Ser Gly Pro Ser Ala Ser 

215 220 225 

Gin Leu Gin Gin Leu Gin Met Gin Leu Gin Leu Glu Arg Gin His 

230 235 240 

Ala Gin Ala Ala Arg Gin Gin Leu Glu Thr Ala Arg Asn Ala Thr 

245 250 255 

Arg Arg Thr Asn Thr Ser Ser Val Thr Thr Thr lie Thr Gin Ser 

260 265 270 

Thr Ala Thr Thr Asn He Ala Asn Thr Glu Ser Ser Gin Gin Thr 

275 280 285 

Leu Gin Asn Ser Gin Phe Leu Leu Thr Arg Leu Asn Asp Pro Lys 

290 295 300 

Met Ser Glu Thr Glu Arg Gin Ser Met Glu Ser Glu Arg Ala Asp 

305 310 315 

Arg Ser Leu Phe Val Gin Glu Leu Leu Leu Ser Thr Leu Val Arg 

320 325 330 

Glu Glu Ser Ser Ser Ser Asp Glu Asp Asp Arg Gly Glu Met Ala 

335 340 345 

Asp Phe Gly Ala Met Gly Cys Val Asp He Met Pro Leu Asp Val 

350 355 360 

Ala Leu Glu Asn Leu Asn Leu Lys Glu Ser Asn Lys Gly Asn Glu 

365 370 375 

Pro Pro Pro Pro Pro Leu 



380 



<210> 9 
<211> 190 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 2786302CD1 

<400> 9 

Met Lys Tyr Gly Asn Glu lie Met Asn Lys Asp Pro Val Phe Arg 
15 10 15 

lie Ser Pro Arg Ser Arg Glu Thr His Pro Asn Pro Glu Glu Pro 
20 25 30 

Glu Glu Glu Asp Glu Asp Val Gin Ala Glu Arg Val Gin Ala Ala 
35 40 45 

Asn Ala Leu Thr Ala Pro Asn Leu Glu Glu Glu Pro Val lie Thr 
50 55 60 

Ala Ser Cys Leu His Lys Glu Tyr Tyr Glu Thr Lys Lys Ser Cys 
65 70 75 

Phe Ser Thr Arg Lys Lys Lys lie Ala He Arg Asn Val Ser Phe 
80 85 90 

Cys Val Lys Lys Gly Glu Val Leu Gly Leu Leu Gly His Asn Gly 
95 100 105 

Ala Gly Lys Ser Thr Ser He Lys Met He Thr Gly Cys Thr Lys 
110 115 120 

Pro Thr Ala Gly Val Val Val Leu Gin Gly Ser Arg Ala Ser Val 
125 130 135 

Arg Gin Gin His Asp Asn Ser Leu Lys Phe Leu Gly Tyr Cys Pro 
140 145 150 

Gin Glu Asn Ser Leu Trp Pro Lys Leu Thr Met Lys Glu His Leu 
155 160 165 

Glu Leu Tyr Ala Ala Val Glu Arg Leu Gly Gin Lys Arg Cys Cys 
170 175 180 

Ser Gin Tyr Phe Thr He Gly Gly Arg Ser 



<210> 10 
<211> 297 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3735780CD1 

<400> 10 

Met Met Asp Ser Glu Ala His Glu Lys Arg Pro Pro He Leu Thr 
15 10 15 

Ser Ser Lys Gin Asp He Ser Pro His He Thr Asn Val Gly Glu 
20 25 30 

Met Lys His Tyr Leu Cys Gly Cys Cys Ala Ala Phe Asn Asn Val 
35 40 45 

Ala He Thr Phe Pro He Gin Lys Val Leu Phe Arg Gin Gin Leu 
50 55 60 

Tyr Gly He Lys Thr Arg Asp Ala He Leu Gin Leu Arg Arg Asp 
65 70 75 

Gly Phe Arg Asn Leu Tyr Arg Gly He Leu Pro Pro Leu Met Gin 
80 85 90 

Lys Thr Thr Thr Leu Ala Leu Met Phe Gly Leu Tyr Glu Asp Leu 
95 100 105 

Ser Cys Leu Leu His Lys His Val Ser Ala Pro Glu Phe Ala Thr 
110 115 120 

Ser Gly Val Ala Ala Val Leu Ala Gly Thr Thr Glu Ala He Phe 
125 130 135 

Thr Pro Leu Glu Arg Val Gin Thr Leu Leu Gin Asp His Lys His 
140 145 150 

His Asp Lys Phe Thr Asn Thr Tyr Gin Ala Phe Lys Ala Leu Lys 
155 160 165 

Cys His Gly He Gly Glu Tyr Tyr Arg Gly Leu Val Pro lie Leu 
170 175 180 

Phe Arg Asn Gly Leu Ser Asn Val Leu Phe Phe Gly Leu Arg Gly 
185 190 195 

Pro He Lys Glu His Leu Pro Thr Ala Thr Thr His Ser Ala His 



185 



190 
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200 










205 










210 


Leu 


Val 


Asn 


Asp 


Phe 


He 


Cys 


Gly 


Gly Leu 


Leu 


Gly Ala 


Met 


Leu 










215 










220 










225 


Gly 


Phe 


Leu 


Phe 


Phe 


Pro 


He 


Asn 


Val 


Val 


Lys 


Thr 


Arg 


He 


Gin 










230 










235 








240 


Ser 


Gin 


He 


Gly 


Gly Glu 


Phe 


Gin 


Ser 


Phe 


Pro 


Lys 


Val 


Phe 


Gin 










245 










250 








255 


Lys 


He 


Trp 


Leu 


Glu 
260 


Arg 


Asp 


Arg 


Lys 


Leu 
265 


He 


Asn 


Leu 


Phe 


Arg 
270 


Gly Ala 


His 


Leu 


Asn 


Tyr 


His 


Arg 


Ser 


Leu 


He 


Ser 


Trp 


Gly 


He 










275 










280 








285 


He 


Asn 


Ala 


Thr 


Tyr 
290 


Glu 


Phe 


Leu 


Leu 


Lys 
295 


Val 


He 









<210> 11 
<211> 89 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc__f eature 
<223> Incyte ID No: 

<400> 11 

Met Ala Ala Gin He 
1 5 
Glu Phe Leu Gly Thr 
20 

Asp Cys Val Lys Asp 
35 

Thr Thr Cys Ser Glu 
50 

Gin Arg He Ser Met 
65 

Glu Ala Leu Ala Ala 
80 

<210> 12 
<211> 115 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



039026CD1 



Pro 


Glu 


Ser 


Asp 


Gin 
10 


Tyr 


Asn 


Lys 


Leu 


Thr 
25 


Phe 


Thr 


Thr 


Arg 


Glu 
40 


His 


Cys 


Leu 


Gin 


Lys 
55 


Arg 


Phe 


Gin 


Glu 


Tyr 
70 


Lys 


Ala 


Gly 


Leu 


Leu 
85 



260607CD1 



He 


Lys 


Gin 


Phe 


Lys 










15 


Glu 


Thr 


Cys 


Phe 


Leu 










30 


Val 


Lys 


Pro 


Glu 


Glu 










45 


Tyr 


Leu 


Lys 


Met 


Thr 










60 


His 


He 


Gin 


Gin 


Asn 










75 


Gly 


Gin 


Pro 


Arg 





<400> 12 



Met 


Ala 


Leu 


He Pro Ser Arg Val 


Trp 


Leu 


Pro 


Phe 


Ala 


val 


Trp 


1 






5 




10 










15 


Val 


Val 


Asp 


Ser Ala Pro Val Arg 
20 


Gly 


Leu 
25 


Val 


Arg 


Arg 


Glu 


Pro 
30 


Phe 


Leu 


Arg 


Thr Gly Ser Phe He 


Ala 


Leu 


Phe 


Tyr 


Phe 


Pro 


Pro 








35 




40 








45 


Leu 


Leu 


Pro 


Val Leu He Asn Leu 
50 


Phe 


Ser 
55 


Phe 


Phe 


Leu 


Thr 


Pro 
60 


Ser 


Phe 


Trp 


Arg Gin Leu Gly Ala 
65 


He 


Leu 
70 


Val 


Tyr 


Ala 


Ser 


Leu 
75 


Leu 


Ala 


Glu 


Lys Thr Pro Phe Lys 
80 


Thr 


Gin 
85 


Arg 


Thr 


Leu 


Glu 


Gly 
90 


Asp 


Ala 


Leu 


Val Gly Ser Val Ser 


He 


Phe 


Leu 


Cys 


Ala 


Lys 


Asp 








95 




100 








105 


Arg 


Gin 


Thr 


Glu Ala Glu Arg Gly 
110 


Cys 


Ser 
115 













<210> 13 
<211> 675 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 
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<223> Incyte ID No: 1429651CD1 



<400> 13 



Met Glu 


Ser 


Gly 


Thr 


Ser 


Ser 


Pro 


Gin 


Pro 


Pro 


Gin 


Leu 


Asp 


Pro 


1 






5 










10 








15 


Leu Asp 


Ala 


Phe 


Pro 


Gin 


Lys 


Gly Leu 


Glu 


Pro 


Gly 


Asp 


He 


Ala 


Val Leu 






20 










25 








30 


Val 


Leu 


Tyr 


Phe 


Leu 


Phe 


Val 


Leu 


Ala 


Val 


Gly 


Leu 


Trp 








35 










40 








45 


Ser Thr 


Val 


Lys 


Thr 


Lys 


Arg 


Asp 


Thr 


Val 


Lys 


Gly 


Tyr 


Phe 


Leu 








50 










55 










60 


Ala Gly Gly 


Asp 


Met 


Val 


Trp 


Trp 


Pro 


Val 


Gly Ala 


Ser 


Leu 


Phe 








65 










70 










75 


Ala Ser 


Asn 


Val 


Gly 


Ser 


Gly 


His 


Phe 


He 


Gly Leu Ala 


Gly 


Ser 








80 










85 








90 


Gly Ala Ala 


Thr 


Gly 


lie 


Ser 


Val 


Ser 


Ala 


Tyr 


Glu 


Leu 


Asn 


Gly 








95 










100 










105 


Leu Phe 


Ser 


Val 


Leu 


Met 


Leu 


Ala 


Trp 


He 


Phe 


Leu 


Pro 


He 


Tyr 








110 










115 










120 


lie Ala Gly 


Gin 


Val 


Thr 


Thr 


Met 


Pro 


Glu 


Tyr 


Leu 


Arg 


Lys 


Arg 








125 










130 








135 


Phe Gly Gly 


He 


Arg 


He 


Pro 


He 


He 


Leu 


Ala 


Val 


Leu 


Tyr 


Leu 


Phe He 






140 










145 








150 


Tyr 


He 


Phe 


Thr 


Lys 


He 


Ser 


Val 


Asp 


Met 


Tyr 


Ala 


Gly 








155 










160 










165 


Ala He 


Phe 


He 


Gin 


Gin 


Ser 


Leu 


His 


Leu 


Asp 


Leu 


Tyr 


Leu 


Ala 








170 










175 








180 


He Val 


Gly 


Leu 


Leu 


Ala 


He 


Thr 


Ala 


Val 


Tyr 


Thr 


Val 


Ala 


Gly 








185 










190 










195 


Gly Leu Ala 


Ala 


Val 


He 


Tyr 


Thr 


Asp 


Ala 


Leu 


Gin 


Thr 


Leu 


He 








200 










205 










210 


Met Leu 


He 


Gly 


Ala 


Leu 


Thr 


Leu 


Met 


Gly Tyr 


Ser 


Phe 


Ala 


Ala 








215 










220 










225 


Val Gly Gly 


Met 


Glu 


Gly 


Leu 


Lys 


Glu 


Lys 


Tyr 


Phe 


Leu 


Ala 


Leu 


Ala Ser 






230 










235 










240 


Asn 


Arg 


Ser 


Glu 


Asn 


Ser 


Ser 


Cys 


Gly 


Leu 


Pro 


Arg 


Glu 


Asp Ala 






245 










250 








255 


Phe 


His 


He 


Phe 


Arg 


Asp 


Pro 


Leu 


Thr 


Ser 


Asp 


Leu 


Pro 








260 










265 








270 


Trp Pro 


Gly 


Val 


Leu 


Phe 


Gly 


Met 


Ser 


He 


Pro 


Ser 


Leu 


Trp 


Tyr 








275 










280 








285 


Trp Cys 


Thr 


Asp 


Gin 


Val 


He 


Val 


Gin 


Arg 


Thr 


Leu 


Ala 


Ala 


Lys 








290 










295 










300 


Asn Leu 


Ser 


His 


Ala 


Lys 


Gly 


Gly Ala 


Leu 


Met 


Ala 


Ala 


Tyr 


Leu 








305 










310 








315 


Lys Val 


Leu 


Pro 


Leu 


Phe 


He 


Met 


Val 


Phe 


Pro Gly Met 


Val 


Ser 


Arg He 






320 










325 










330 


Leu 


Phe 


Pro 


Asp 


Gin 


Val 


Ala 


Cys 


Ala 


Asp 


Pro 


Glu 


He 








335 










340 








345 


Cys Gin 


Lys 


He 


Cys 


Ser 


Asn 


Pro 


Ser 


Gly Cys 


Ser 


Asp 


lie 


Ala 








350 










355 








360 


Tyr Pro 


Lys 


Leu 


Val 


Leu 


Glu 


Leu 


Leu 


Pro 


Thr Gly Leu Arg 


Gly 








365 










370 










375 


Leu Met 


Met 


Ala 


Val 


Met 


Val 


Ala 


Ala 


Leu 


Met 


Ser 


Ser 


Leu 


Thr 








*3 o n 










385 










390 


Ser He 


Phe 


Asn 


Ser 


Ala 


Ser 


Thr 


He 


Phe 


Thr 


Met 


Asp 


Leu 


Trp 


Asn His 






.j j j 










400 










Leu 


Arg 


Pro 


Arg 


Ala 


Ser 


Glu 


Lys 


Glu 


Leu 


Met 


lie 


Val 








410 










415 










420 


Gly Arg Val 


Phe 


Val 


Leu 


Leu 


Leu 


Val 


Leu 


Val 


Ser 


He 


Leu 


Trp 


He Pro 






425 










430 










435 


Val 


Val 


Gin 


Ala 


Ser 


Gin Gly 


Gly Gin 


Leu 


Phe 


lie 


Tyr 


He Gin 






440 










445 










450 


Ser 


He 


Ser 


Ser 


Tyr 


Leu 


Gin 


Pro 


Pro 


Val 


Ala 


Val 


Val 


Phe He 






455 










460 










465 


Met 


Gly 


Cys 


Phe 


Trp 


Lys 


Arg 


Thr 


Asn 


Glu 


Lys 


Gly 


Ala 








470 










475 








480 


Phe Trp Gly 


Leu 


He 


Ser 


Gly 


Leu 


Leu 


Leu 


Gly Leu Val 


Arg 


Leu 
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485 










490 






495 


Val 


Leu 


Asp 


Phe 


He 


Tyr 


Val 


Gin 


Pro 


Arg 


Cys Asp Gin 


Pro 


Asp 










500 










505 






510 


Glu 


Arg 


Pro 


Val 


Leu 


Val 


Lys 


Ser 


He 


His 


Tyr Leu Tyr 


Phe 


Ser 










515 










520 






525 


Met 


He 


Leu 


Ser 


Thr 


Val 


Thr 


Leu 


He 


Thr 


Val Ser Thr 


Val 


Ser 










530 










535 






540 


Trp 


Phe 


Thr 


Glu 


Pro 


Pro 


Ser 


Lys 


Glu 


Met 


Val Ser His 


Leu 


Thr 








545 










550 






555 


Trp 


Phe 


Thr 


Arg 


His 


Asp 


Pro 


Val 


Val 


Gin 


Lys Glu Gin 


Ala 


Pro 








560 










565 






570 


Pro 


Ala 


Ala 


Pro 


Leu 


Ser 


Leu 


Thr 


Leu 


Ser Gin Asn Gly Met 


Pro 










575 










580 






585 


Glu 


Ala 


Ser 


Ser 


Ser 


Ser 


Ser 


Val 


Gin 


Phe 


Glu Met Val 


Gin 


Glu 










590 










595 






600 


Asn 


Thr 


Ser 


Lvs 


Thr 


His 


Ser 


Cys 


Asp 


Met 


Thr Pro Lys 


Gin 


Ser 








605 










610 






615 


Lys 


Val 


Val 


Lys 


Ala 


He 


Leu 


Trp 


Leu 


Cys 


Gly He Gin 


Glu 


Lys 








620 










625 






630 


Gly 


Lys 


Glu 


Glu 


Leu 


Pro 


Ala 


Arg 


Ala 


Glu 


Ala He He 


Val 


Ser 






635 










640 






645 


Leu 


Glu 


Glu 


Asn 


Pro 


Leu 


Val 


Lys 


Thr 


Leu 


Leu Asp Val 


Asn 


Leu 










650 










655 






660 


lie 


Phe 


Cys 


Val 


Ser 


Cys 


Ala 


He 


Phe 


He 


Trp Gly Tyr 


Phe 


Ala 








665 










670 






675 



<210> 14 
<211> 320 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2069971CD1 

<400> 14 

Met Tyr His Cys His Ser Gly Ser Lys Pro Thr Glu Lys Gly Ala 
15 10 15 

Asn Glu Tyr Ala Tyr Ala Lys Trp Lys Leu Cys Ser Ala Ser Ala 
20 25 30 

He Cys Phe lie Phe Met He Ala Glu Val Val Gly Gly His He 
35 40 45 

Ala Gly Ser Leu Ala Val Val Thr Asp Ala Ala His Leu Leu He 
50 55 60 

Asp Leu Thr Ser Phe Leu Leu Ser Leu Phe Ser Leu Trp Leu Ser 
65 70 75 

Ser Lys Pro Pro Ser Lys Arg Leu Thr Phe Gly Trp His Arg Ala 
80 85 90 

Glu He Leu Gly Ala Leu Leu Ser He Leu Cys He Trp Val Val 
95 100 105 

Thr Gly Val Leu Val Tyr Leu Ala Cys Glu Arg Leu Leu Tyr Pro 
110 115 120 

Asp Tyr Gin lie Gin Ala Thr Val Met He He Val Ser Ser Cys 
125 130 135 

Ala Val Ala Ala Asn lie Val Leu Thr Val Val Leu His Gin Arg 
140 145 150 

Cys Leu Gly His Asn His Lys Glu Val Gin Ala Asn Ala Ser Val 
155 160 165 

Arg Ala Ala Phe Val His Ala Leu Gly Asp Leu Phe Gin Ser He 
170 175 180 

Ser Val Leu He Ser Ala Leu He He Tyr Phe Lys Pro Glu Tyr 
185 190 195 

Lys He Ala Asp Pro He Cys Thr Phe He Phe Ser He Leu Val 
200 205 210 

Leu Ala Ser Thr He Thr He Leu Lys Asp Phe Ser lie Leu Leu 
215 220 225 

Met Glu Gly Val Pro Lys Ser Leu Asn Tyr Ser Gly Val Lys Glu 
230 235 240 
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Leu 


He 


Leu. 


Ala 


val 


Asp Gly 


Val 


Leu 










1 A C 










He 


Trp 


Ser 


Leu 


Thr 


Met 


Asn 


Gin. 


val 










OCA 










Ala 


Thr 


Ala 


Ala 


tier 


Arg 


Asp 


Ser 


i^in 










o *7 c: 










Ala 


Lys 


Ala 


Leu 


Ser 


Lys 


Ser 


Phe 


Thr 










290 










Gin 


Met 


Glu 


Ser 


Pro 


Val 


Asp 


Gin 


Asp 










305 










Glu 


Asp 


Pro 


Cys 


Asp 


















320 











<210> 15 
<211> 462 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2329339CD1 

<400> 15 

Met Ala Glu Glu Gin Glu Phe Thr Gin 

1 5 
Gin Pro Ser His Pro His Cys Val Asn 
20 

Gin His Ser Gin Ala Leu Leu Arg Gly 
35 

Ser Gly He Leu Phe Asp Val Val Leu 
50 

He Glu Ala His Arg He Leu Leu Ala 
65 

Arg Gly Met Phe Ala Gly Gly Leu Lys 
80 

Val Leu He His Gly Val Ser Tyr Asn 
95 

His Phe He Tyr Thr Ser Glu Leu Glu 
110 

Gin Glu Thr Leu Val Ala Ala Cys Gin 
125 

He His Phe Cys Cys Asp Phe Leu Met 
140 

Asn lie Leu Asp Val Tyr Arg Leu Ala 
155 

Arg Leu Thr Glu Gin Leu Asp Thr Tyr 
170 

Ala Phe Ser Arg Thr Asp Lys Tyr Arg 
185 

Val Tyr Ser Leu Leu Ser Ser Asn Arg 
200 

Thr Glu Val Tyr Glu Gly Ala Leu Leu 
215 

Gin Val Gin Ala Asp Gin He Ser Leu 
230 

Leu Glu Thr Val Arg Phe Pro Leu Met 
245 

Arg Leu His Asp Lys Leu Asp Pro Ser 
260 

Ala Ser Gly Leu Met Tyr His Arg Asn 
275 

Leu Gin Ser Pro Gin Thr Glu Leu Arg 
290 

Val Gly Phe Gly Gly He His Ser Thr 
305 

Asp Gin Ala Lys Tyr Leu Asn Pro Leu 
320 

Phe Thr Ala Ser Leu Ala Pro Arg Met 
335 
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Ser 


Val 


His 


Ser 


Leu 


His 


250 










255 


He 


Leu 


Ser 


Ala 


His 


Val 


265 










270 


Val 


Val 


Arg 


Arg 


Glu 


He 


280 








285 


Met 


His 


Ser 


Leu 


Thr 


He 


295 










300 


Pro 


Asp 


Cys 


Leu 


Phe 


Cys 


310 










315 



Leu 


Cys 


Lys 


Leu 


Pro 


Ala 


10 










15 


Asn 


Thr 


Tyr 


Arg 


Ser 


Ala 


25 










30 


Leu 


Leu 


Ala 


Leu 


Arg 


Asp 


40 










45 


Val 


Val 


Glu Gly Arg 


His 


55 










60 


Ala 


Ser 


Cys 


Asp 


Tyr 


Phe 


70 










75 


Glu 


Met 


Glu 


Gin 


Glu 


Glu 


85 










90 


Ala 


Met 


Cys 


Gin 


He 


Leu 


100 










105 


Leu 


Ser 


Leu 


Ser 


Asn 


Val 


115 










120 


Leu 


Gin 


He 


Pro 


Glu 


He 


130 










135 


Ser 


Trp Val 


Asp 


Glu 


Glu 


145 










150 


Glu 


Leu 


Phe 


Asp 


Leu 


Ser 


160 










165 


He 


Leu 


Lys 


Asn 


Phe 


Val 


175 










180 


Gin 


Leu 


Pro 


Leu 


Glu 


Lys 


190 










195 


Leu 


Glu 


Val 


Ser 


Cys 


Glu 


205 










210 


Tyr 


His 


Tyr 


Ser 


Leu 


Glu 


220 










225 


His 


Glu 


Pro 


Pro 


Lys 


Leu 


235 










240 


Glu 


Ala 


Glu 


Val 


Leu 


Gin 


250 










255 


Pro 


Leu 


Arg 


Asp 


Thr 


Val 


265 










270 


Glu 


Ser 


Leu 


Gin 


Pro 


Ser 


280 










285 


Ser 


Asp 


Phe 


Gin 


Cys 


Val 


295 










300 


Pro 


Ser 


Thr 


Val 


Leu 


Ser 


310 










315 


Leu Gly Glu 


Trp 


Lys 


His 


325 










330 


Ser Asn Gin Gly 


He 


Ala 


340 










345 
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Val Leu 


Asn 


Asn 


Phe 


Val 


Tyr 


Leu 


He 








350 








Gin Gly 


Phe 


Arg 


Ala 


Glu 


Ser 


Arg 


Cys 








365 










His Asn 


Arg 


Trp 


Phe 


Gin 


He 


Gin 


Ser 








380 










Asp Leu 


Ser 


Val 


Cys 


Val 


Val 


Gly Arg 








395 










Gly Arg 


Asp 


Tyr 


His 


Asn 


Asp 


Leu 


Asn 








410 










Pro Ala 


Thr 


Asn 


Ser 


Trp 


Ala 


Tyr 


Val 








425 










Val Tyr 


Ala 


His 


Ala 


Gly 


Ala 


Thr 


Leu 








440 










Thr Cys 


Gly 


Arg 


Lys 


Leu 


He 


Pro 


Phe 








455 










<210> 16 














<211> 98 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_f eature 










<223> Incyte ID 


No: 


2540219CD1 




<400> 16 














Met Arg 


Ala 


Cys 


Ala 


Val 


Trp 


Leu 


Ala 


1 






5 










He Ser 


Trp Gly 


Thr 


Ala 


Thr 


Pro 


Met 








20 










Leu Gin 


Ala 


Asp 


Gly 


Val 


Tyr 


Leu 


Asn 








35 










Asp Cys 


He 


Ser 


Gin 


Ser 


Tyr 


Gin 


Lys 








50 










Phe Arg 


Gly 


He 


Thr 


Val 


Asn 


Ala 


Val 








65 










Ala Ala 


Met 


Phe 


Leu 


Gly 


Tyr 


Glu 


Leu 








80 










Gly Asp 


His 


Ala 


Val 


Thr 


Ser 


Pro 










95 










<210> 17 














<211> 748 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_: 


Eeature 










<223> Incyte ID 


No: 


2722462CD1 




<400> 17 














Met Asn 


Tyr 


Gin 


Glu 


Ala 


Ala 


He 


Tyr 


1 






5 










Asn Asp 


Lys 


Phe 


Phe 


Thr 


His 


Pro 


Lys 








20 








Ala Tyr 


Leu 


Phe 


Ala 


His 


Asn 


His 


Leu 








35 










Ala Thr 


Ala 


Leu 


Leu 


Leu 


Leu 


Leu 


Leu 








50 










Ala Val 


Pro 


Ala 


Leu 


Arg 


Leu 


Gly 


He 








65 








Glu Leu 


Phe 


Ala 


Leu 


Met 


Val 


Val 


Val 








80 










Leu Arg 


Trp 


Leu 


Gly 


Leu 


His 


Thr 


Phe 








95 










Met Val 


Lys 


Thr 


Ser 


Val 


Leu 


Val 


Val 








110 










Val Val 


Leu 


Val 


Arg 


Gin 


Met 


Ser 


His 








125 
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Gly 


Gly Asp Asn 


Asn 


Val 


355 






3 60 


Trp 


Arg Tyr Asp 


Pro 


Arg 


370 






375 


Leu 


Gin Gin Glu 


His 


Ala 


385 






390 


Tyr 


He Tyr Ala 


Val 


Ala 


400 






405 


Ala 


Val Glu Arg 


Tyr 


Asp 


415 






420 


Ala 


Pro Leu Lys 


Arg 


Glu 


430 






435 


Glu 


Gly Lys Met 


Tyr 


He 


445 






450 


Ser 


Glu Gly 






460 









Gly 


Gly Met 


Ala 


Gly Ala 


10 










15 


Asp 


Val 


Val 


Lys 


Ser 


Arg 


25 










30 


Lys 


Tyr 


Lys 


Gly Val 


Leu 


40 










45 


Glu 


Gly 


Leu 


Lys 


Val 


Phe 


55 










60 


Arg 


Gly 


Phe 


Pro 


Met 


Ser 


70 










75 


Ser 


Leu 


Gin 


Ala 


He 


Arg 


85 










90 



Leu 


Gin Glu Gly Glu Asn 


10 










15 


Asp 


Ala 


Lys 


Ala 


Leu 


Ala 


25 










30 


Phe 


Tyr 


Leu 


Met 


Glu 


Leu 


40 










45 


Ser 


Leu Cys 


Glu 


Ala 


Pro 


55 










60 


Tyr 


Val 


His 


Ala 


Thr 


Leu 


70 










75 


Phe 


Glu 


Leu 


Cys 


Met 


Lys 


85 










90 


lie 


Arg 


His 


Lys 


Arg 


Thr 


100 










105 


Gin 


Phe 


Val 


Glu 


Ala 


He 


115 










120 


Val 


Arg 


Val 


Thr 


Arg 


Ala 


130 










135 
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Leu 


Arg 


Cys 


He 


Phe 


Leu 


Val 


Aso 


Cvs 


Arg 


Tyr 


Cvs 


GlV 


Glv 


Val 










140 










145 










150 


Arg 


Arg 


Asn 


Leu 


Arg 


Gin 


He 


Phe 


Gin 


Ser 


Leu 


Pro 


Pro 


Phe 


Met 










155 










160 










165 


Asp 


He 


Leu 


Leu 


Leu 


Leu 


Leu 


Phe 


Phe 


Met 


He 


He 


Phe 


Ala 


He 










170 










175 










180 


Leu Gly Phe 


Tyr 


Leu 


Phe 


Ser 


Pro 


Asn 


Pro 


Ser 


Asp 


Pro 


Tyr 


Phe 










185 










190 






195 


Ser 


Thr 


Leu 


Glu 


Asn 


Ser 


He 


Val 


Ser 


Leu 


Phe 


Val 


Leu 


Leu 


Thr 










200 










205 










210 


Thr 


Ala 


Asn 


Phe 


Pro 


Asp 


Val 


Met 


Met 


Pro 


Ser 


Tyr 


Ser 


Arg 


Asn 










215 










220 










225 


Pro 


Trp 


Ser 


Cys 


Val 


Phe 


Phe 


He 


Val 


Tyr 


Leu 


Ser 


He 


Glu 


Leu 










230 










235 










240 


Tyr 


Phe 


He 


Met 


Asn 


Leu 


Leu 


Leu 


Ala 


Val 


Val 


Phe 


Asp 


Thr 


Phe 










245 










250 










255 


Asn Asp 


He 


Glu 


Lys 


Arg 


Lys 


Phe 


Lvs 


Ser 


Leu 


Leu 


Leu 


His 


Lvs 










260 










265 










270 


Arg 


Thr 


Ala 


He 


Gin 


His 


Ala 


Tyr 


Arg 


Leu 


Leu 


He 


Ser 


Gin 


Aro 










275 










280 










285 




Pro Ala Gly 


He 


Ser 


Tvr 


Arg 


Gin 


Phe 


Glu 


Gly Leu Met 


Arcr 










290 










295 










300 


It i lei 


Tyr 


Lys 


Pro 


Arg 


Met 


Ser 


Ala 


Aro 


Glu 


Arg 


Tyr 


Leu 


Thr 


Phe 










305 










310 










315 


Lys 


Ala 


Leu 


Asn 


Gin 


Asn 


Asn 


Thr 


Pro 


Leu 


Leu 


Ser 


Leu 


Lys 


Asp 










320 










325 










330 




Tyr 


Asp 


lie 


Tyr 


Glu 


Val 


Ala 


Ala 


Leu 


Lys 


Trp 


Lys 


Ala 


Lys 










335 










340 










345 


Lys 


Asn 


Arg 


Glu 


His 


Trp 


Phe 


Asp 


Glu 


Leu 


Pro Arg 


Thr 


Ala 


Leu 










350 










355 










360 


Leu 


He 


Phe 


Lys 


Gly 


He 


Asn 


He 


Leu 


Val 


Lys 


Ser 


Lys 


Ala 


Phe 










365 










370 






375 


Gin 


Tyr 


Phe 


Met 


Tvr 


Leu 


Val 


Val 


Ala 


Val 


Asn Gly Val 


Trp 


He 










380 










385 










390 


Leu 


Val 


Glu 


Thr 


Phe 


Met 


Leu 


Lys 


Gly Gly Asn 


Phe 


Phe 


Ser 


Lys 










395 










400 










405 


His 


Val 


Pro 


Trp 


Ser 


Tvr 


Leu 


Val 


Phe 


Leu 


Thr 


He 


Tyr Gly Val 










410 










415 










420 


Glu 


Leu 


Phe 


Leu 


Lys 


Val 


Ala 


Gly Leu Gly 


Pro 


Val 


Glu 


Tyr 


Leu 










425 










430 










435 


Ser 


Ser Gly 


Trp 


Asn 


Leu 


Phe 


Asp 


Phe 


Ser 


Val 


Thr 


Val 


Phe 


Ala 










440 










445 










450 


Phe 


Leu Gly 


Leu 


Leu 


Ala 


Leu 


Ala 


Leu 


Asn 


Met 


Glu 


Pro 


Phe 


Tyr 










455 










460 










465 


Phe 


He 


Val 


Val 


Leu 


Arg 


Pro 


Leu 


Gin 


Leu 


Leu 


Arg 


Leu 


Phe 


Lys 










470 










475 










480 


Leu 


Lys 


Glu 


Arg 


Tyr 


Arg 


Asn 


Val 


Leu 


Asp 


Thr 


Met 


Phe 


Glu 


Leu 










485 










490 










495 


Leu 


Pro 


Arg 


Met 


Ala 


Ser 


Leu 


Gly 


Leu 


Thr 


Leu 


Leu 


He 


Phe 


Tyr 










500 










505 










510 


Tyr 


Ser 


Phe 


Ala 


He 


Val 


Gly Met 


Glu 


Phe 


Phe 


Cys 


Gly 


He 


Val 










515 










520 










525 


Phe 


Pro 


Asn 


Cys 


Cys 


Asn 


Thr 


Ser 


Thr 


Val 


Ala 


Asp 


Ala 


Tyr 


Arg 










530 










535 










540 


Trp 


Arg 


Asn 


His 


Thr 


Val 


Gly Asn Arg Thr 


Val 


Val 


Glu 


Glu Gly 










545 










550 










555 


Tyr 


Tyr 


Tyr 


Leu 


Asn 


Asn 


Phe 


Asp Asn 


He 


Leu 


Asn 


Ser 


Phe 


Val 










560 










565 










570 


Thr 


Leu 


Phe 


Glu 


Leu 


Thr 


Val 


Val 


Asn 


Asn 


Trp Tyr 


He 


He 


Met 










575 










580 










585 


Glu Gly Val 


Thr 


Ser 


Gin 


Thr 


Ser 


His 


Trp 


Ser Arg 


Leu 


Tyr 


Phe 










590 










595 










600 


Met 


Thr 


Phe 


Tyr 


He 


Val 


Thr 


Met 


Val 


Val 


Met 


Thr 


He 


He 


Val 










605 










610 










615 


Ala 


Phe 


He 


Leu 


Glu 


Ala 


Phe 


Val 


Phe 


Arg 


Met 


Asn 


Tyr 


Ser 


Arg 










620 










625 










630 


Lys 


Asn Gin Asp 


Ser 


Glu 


Val 


Asp Gly Gly 


He 


Thr 


Leu 


Glu 


Lys 










635 










640 










645 



14/60 




WO 00/78953 



Glu lie 


Ser 


Lys 


Glu 


Glu 


Leu 


Val 


Ala 








650 










Glu Ala 


Arg 


Gly 


Ala 


Ser 


Ser 


Asp 


Val 








665 








Leu Ser 


Gin 


Met 


Glu 


Arg 


Tyr 


Gin 


Gin 








680 










Gly Arg 


Arg 


Ser 


Arg 


Thr 


Lys 


Ser 


Asp 








695 










Gin Glu 


Glu 


He 


Gin 


Glu 


Trp 


Tyr 


Glu 








710 










Glu Gin 


Gin Arg 


Gin 


Leu 


Ser 


Ser 


Ser 








725 










Gin Pro 


Pro Gly 


Ser 


Arg 


Gin 


Arg 


Ser 








740 










<210> 18 














<211> 507 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_ 


feature 










<223> Incyte ID 


No: 


2739264CD1 




<400> 18 














Met Ala 


Phe 


Asn 


Phe 


Gly 


Ala 


Pro 


Ser 


1 






5 










Ala Ala 


Thr 


Ala 


Ala 


Pro 


Ala 


Gly 


Gly 








20 






Thr Ser 


Thr 


inr 


Ala 


Gly 


Ser 


Ala 


Phe 








35 










Asn Thr 


Gly 


1 ILL 


Thr 


\j±y 


Leu 


Fne 


Gly 








50 








Phe Gly 


Phe 


uiy 


Thr 




nun 

rue 


Kj±y 


Thr 








65 










Gly Leu 


Gly 


Thr 


Gly 






rpV, „ 


Gly 








80 










Asn Thr 


Gin 


Gin 


Gin 






X LIL 


Thr 








95 










Gin Pro 


Thr 


Gin 


Ala 


Pro 


Thr 


Gin 


Ser 








110 










Ala Ser 


Ala 


Leu 


Ser 


Ala 


Pro 


Thr 


Leu 








125 










Ala He 


Leu 


Ala 


Lys 


Trp 


Asn 


Gin 


Leu 








140 










Gly Lys 


Gly 


Tyr 


Phe 


Asn 


Asn 


Asn 


He 








155 










Gin Glu 


Asn 


Pro 


Phe 


Cys 


Arg 


Phe 


Lys 








170 










Met Pro 


Ser 


Asn 


Lys 


Asp 


Glu 


Asp 


Gly 








185 










Asn Lys 


Lys 


Glu 


Thr 


Glu 


He 


Arg 


Ser 








200 










Glu Ser 


Leu 


His 


Lys 


Val 


Leu 


Gly 


Gly 








215 










Asn Val 


Glu 


Gly 


Thr 


Lys 


Thr 


Leu 


Pro 








230 










Val He 


Tyr 


Val 


Val 


Glu 


Arg 


Ser 


Pro 








245 










Val Pro 


Ala 


Thr 


Thr 


Leu 


Tyr 


Ala 


His 








260 










Lys Thr 


Gin 


Leu 


Gin 


Gin 


Leu 


Gly 


Val 








275 








Thr Glu 


Leu 


Ser 


Pro 


Ala 


Gin 


He 


Lys 








290 








Pro Ala 


Gly 


Val 


Asp 


Pro 


He 


He 


Trp 








305 










Asn Pro 


Asp 


Ser 


Glu 


Lys 


Leu 


He 


Pro 








320 












PCT/USOO/16668 



Val 


Leu 


Glu 


Leu 


Tyr 


Arg 


655 










660 


Thr 


Arg 


Leu 


Leu 


Glu 


Thr 


670 










675 


His 


Ser 


Met 


Val 


Phe 


Leu 


685 










690 


Leu 


Ser 


Leu 


Lys 


Met 


Tyr 


700 










705 


Glu 


His 


Ala 


Arg 


Glu 


Gin 


715 










720 


Ala 


Ala 


Pro 


Ala 


Ala 


Gin 


730 










735 


Gin 


Thr 


Val 


Thr 






745 












Gly 


Thr 


Ser 


Gly 


Thr 


Ala 


10 










15 


Phe 


Gly 


Gly 


Phe 


Gly 


Thr 


25 










30 


Ser 


Phe 


Ser 


Ala 


Pro 


Thr 


40 










45 


Gly 


Thr 


Gin 


Asn 


Lys 


Gly 


55 










60 


Thr 


Thr 


Gly 


Thr 


Ser 


Thr 


70 










75 


Leu 


Gly 


Phe 


Gly 


Gly 


Phe 


85 










90 


Leu 


Gly 


Gly 


Leu 


Phe 


Ser 


100 










105 


Asn 


Gin 


Leu 


He 


Asn 


Thr 


115 










120 


Leu 


Gly 


Asp 


Glu 


Arg 


Asp 


130 










135 


Gin 


Ala 


Phe 


Trp 


Gly 


Thr 


145 










150 


Pro 


Pro 


Val 


Glu 


Phe 


Thr 


160 










165 


Ala 


Val 


Gly 


Tyr 


Ser 


Cys 


175 










180 


Leu 


Val 


Val 


Leu 


Val 


Phe 


190 










195 


Gin 


Gin 


Gin 


Gin 


Leu 


Val 


205 










210 


Asn 


Gin 


Thr 


Leu 


Thr 


Val 


220 










225 


Asp 


Asp 


Gin 


Thr 


Glu 


Val 


235 










240 


Asn 


Gly 


Thr 


Ser 


Arg 


Arg 


250 










255 


Phe 


Glu 


Gin 


Ala 


Asn 


He 


265 










270 


Thr 


Leu 


Ser 


Met 


Thr 


Arg 


280 










285 


Gin 


Leu 


Leu 


Gin 


Asn 


Pro 


295 










300 


Glu 


Gin 


Ala 


Lys 


Val 


Asp 


310 










315 


Val 


Pro 


Met 


Val 


Gly 


Phe 


325 










330 



15/60 





WO 00/78953 



PCT/US00/16668 



Lys 


Glu 


Leu 


Leu 


Arg 


Arg 


Leu 


Lys 


Val Gin 


Asp 


Gin 


Met 


Thr 


Lys 










335 








340 










345 


Gin 


His 


Gin 


Thr 


Arg 


Leu 


Asp 


He 


lie Ser 


Glu 


Asp 


He 


Ser 


Glu 










350 








355 










360 


Leu 


Gin 


Lys 


Asn 


Gin 


Thr 


Thr 


Ser 


Val Ala 


Lys 


lie 


Ala 


Gin 


Tyr 










365 








370 










375 


Lys 


Arg 


Lys 


Leu 


Met 


Asp 


Leu 


Ser 


His Arg 


Thr 


Leu 


Gin 


Val 


Leu 










380 








385 










390 


lie 


Lys 


Gin 


Glu 


He 


Gin 


Arg 


Lys 


Ser Gly 


Tyr 


Ala 


lie 


Gin 


Ala 










395 








400 










405 


Asp 


Glu 


Glu 


Gin 


Leu 


Arg 


Val 


Gin 


Leu Asp 


Thr 


lie 


Gin Gly Glu 










410 








415 










420 


Leu 


Asn 


Ala 


Pro 


Thr 


Gin 


Phe 


Lys 


Gly Arg 


Leu 


Asn 


Glu 


Leu 


Met 










425 








430 










435 


Ser 


Gin 


He 


Arg 


Met 


Gin 


Asn 


His 


Phe Gly Ala Val 


Arg 


Ser 


Glu 










440 








445 










450 


Glu 


Arg 


Tyr 


Tyr 


He 


Asp 


Ala 


Asp 


Leu Leu 


Arg 


Glu 


He 


Lys 


Gin 










455 








460 










465 


His 


Leu 


Lys 


Gin 


Gin 


Gin 


Glu 


Gly 


Leu Ser 


His 


Leu 


He 


Ser 


He 










470 








475 










480 


He 


Lys 


Asp 


Asp 


Leu 


Glu 


Asp 


He 


Lys Leu 


Val 


Glu 


His 


Gly 


Leu 










485 








490 










495 


Asn 


Glu 


Thr 


He 


His 


He 


Arg 


Gly Gly Val 


Phe 


Ser 









<210> 19 
<211> 592 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2758310CD1 

<400> 19 



Met 


Trp 


Phe 


Cys 


Gly Gin 


Ser 


Thr 


Pro 


Phe 


Gly Cys 


Glu 


Leu 


His 


1 








5 










10 








15 


Asp 


Thr 


Cys 


Val 


Gin 


Leu 


Cys 


His 


Phe 


His 


Ser Ala 


Leu 


Leu 


His 










20 










25 








30 


Arg 


Arg 


Gin 


Lys 


Pro 


Trp 


Pro 


Ser 


Pro 


Ala 


Val Phe 


Phe 


Arg 


Arg 










35 










40 








45 


Asn 


Val 


Arg 


Gly 


Leu 


Pro 


Pro 


Arg 


Phe 


Ser 


Ser Pro 


Thr 


Pro 


Leu 










50 










55 








60 


Trp 


Arg 


Lys 


Val 


Leu 


Ser 


Thr 


Ala 


Val 


Val 


Gly Ala 


Pro 


Leu 


Leu 










65 










70 








75 


Leu 


Gly 


Ala 


Arg 


Tyr 


Val 


Met 


Ala 


Glu 


Ala 


Arg Glu 


Lys 


Arg 


Arg 










80 










85 








90 


Met 


Arg 


Leu 


Val 


Val 


Asp 


Gly 


Met 


Gly Arg 


Phe Gly Arg 


Ser 


Leu 










95 










100 








105 


Lys 


Val 


Gly 


Leu 


Gin 


He 


Ser 


Leu 


Asp 


Tyr 


Trp Trp 


Cys 


Thr 


Asn 










110 










115 








120 


Val 


Val 


Leu 


Arg 


Gly Trp 


Lys 


Ser 


Pro Gly 


Tyr Leu 


Glu 


Val 


Met 










125 










130 








135 


Ser 


Ala 


Cys 


His 


Gin 


Arg 


Ala 


Ala 


Asp 


Ala 


Leu Val 


Ala 


Gly Ala 










140 








145 








150 


lie 


Ser 


Asn 


Gly 


Gly Leu 


Tyr 


Val 


Lys 


Leu 


Gly Gin Gly Leu Cys 










155 










160 








165 


Ser 


Phe 


Asn 


His 


Leu 


Leu 


Pro 


Pro 


Glu 


Tyr 


Thr Arg 


Thr 


Leu 


Arg 










170 










175 








180 


Val 


Leu 


Glu 


Asp 


Arg 


Ala 


Leu 


Lys 


Arg 


Gly 


Phe Gin 


Glu 


Val 


Asp 










185 










190 








195 


Glu 


Leu 


Phe 


Leu 


Glu 


Asp 


Phe 


Gin 


Ala 


Leu 


Pro His 


Glu 


Leu 


Phe 










200 










205 








210 


Gin 


Glu 


Phe 


Asp 


Tyr 


Gin 


Pro 


lie 


Ala 


Ala 


Ala Ser 


Leu 


Ala 


Gin 










215 










220 








225 


Val 


His 


Arg 


Ala 


Lys 


Leu 


His 


Asp Gly Thr 


Ser Val 


Ala 


Val 


Lys 










230 










235 








240 


Val 


Gin 


Tyr 


lie 


Asp 


Leu 


Arg 


Asp 


Arg 


Phe 


Asp Gly Asp 


lie 


His 



500 



505 



245 



250 



255 



16/60 




WO 00/78953 PCT/US00/16668 



Thr Leu 


Glu 


Leu 


Leu 


Leu 


Arg 


Leu 


Val 


Glu 


Val 


Met 


His 


Pro 


Ser 








260 










265 










270 


Phe Gly 


Phe 


Ser 


Trp 


Val 


Leu 


Gin 


Asp 


Leu 


Lys 


Gly 


Thr 


Leu 


Ala 








275 










280 








285 


Gin Glu 


Leu 


Asp 


Phe 


Glu 


Asn 


Glu 


Gly Arg 


Asn 


Ala 


Glu 


Arg 


Cys 








290 










295 










300 


Ala Arg 


Glu 


Leu 


Ala 


His 


Phe 


Pro 


Tyr 


Val 


Val 


Val 


Pro 


Arg 


Val 








305 










310 










315 


His Ti"p 


Asp 


Lys 


Ser 


Ser 


Lys 


Arg 


Val 


Leu 


Thr 


Ala 


Asp 


Phe 


Cys 








320 










325 










330 


Ala. (j-Ly 


Cys 


Lys 


Val 


Asn 


Asp 


Val 


Glu 


Ala 


He 


Arg 


Ser 


Gin 


Gly 








335 










340 








345 


Leu Ala 


Val 


His 


Asp 


He 


Ala 


Glu 


Lys 


Leu 


He 


Lys 


Ala 


Phe 


Ala 








350 










355 










360 


Glu Gin 


He 


Phe 


Tyr 


Thr 


Gly 


Phe 


He 


His 


Ser 


Asp 


Pro 


His 


Pro 








365 










370 










375 


Gly Asn 


Val 


Leu 


Val 


Arg 


Lys 


Gly 


Pro 


Asp 


Gly 


Lys 


Ala 


Glu 


Leu 








380 










385 










390 


Val Leu 


Leu 


Asp 


His 


Gly 


Leu 


Tyr 


Gin 


Phe 


Leu 


Glu 


Glu 


Lys 


Asp 








395 










400 










405 


Arg Ala 


Ala 


Leu 


Cys 


Gin 


Leu 


Trp 


Arg 


Ala 


He 


He 


Leu 


Arg 


Asp 








410 










415 










420 


Asp Ala 


Ala 


Met 


Arg 


Ala 


His 


Ala 


Ala 


Ala 


Leu 


Gly 


Val 


Gin 


Asp 








425 










430 










435 


Tyr Leu 


Leu 


Phe 


Ala 


Glu 


Met 


Leu 


Met 


Gin 


Arg 


Pro 


Val 


Arg 


Leu 








440 










445 










450 


Gly Gin 


Leu 


Trp 


Gly 


Ser 


His 


Leu 


Leu 


Ser 


Arg 


Glu 


Glu 


Ala 


Ala 








455 










460 










465 


Tyr Met 


Val 


Asp 


Met 


Ala 


Arg 


Glu 


Arg 


Phe 


Glu 


Ala 


val 


Met 


Ala 








470 










475 










480 


Val Leu 


Arg 


Glu 


Leu 


Pro 


Arg 


Pro 


Met 


Leu 


Leu 


val 


Leu 


Arg 


Asn 








485 










490 








495 


lie Asn 


Thr 


Val 


Arg 


Ala 


He 


Asn 


Val 


Ala 


Leu 


Gly 


Ala 


Pro 


Val 








500 










505 








510 


Asp Arg 


Tyr 


Phe 


Leu 


Met 


Ala 


Lys 


Arg 


Ala 


Val 


Arg 


Gly 


Trp 


Ser 








515 










520 










525 


Arg Leu 


Ala 


Gly 


Ala 


Thr 


Tyr 


Arg 


Gly Val 


Tyr 


Gly 


Thr 


Ser 


Leu 








530 










535 










540 


Leu Arg 


His 


Ala 


Lys 


Val 


Val 


Trp 


Glu 


Met 


Leu 


Lys 


Phe 


Glu 


Val 








545 










550 










555 


Ala Leu 


Arg 


Leu 


Glu 


Thr 


Leu 


Ala 


Met 


Arg 


Leu 


Thr 


Ala 


Leu 


Leu 








560 










565 










570 


Ala Arg 


Ala 


Leu 


Val 


His 


Leu 


Ser 


Leu 


Val 


Pro 


Pro 


Ala 


Glu 


Glu 








575 










580 










585 


Leu Tyr 


Gin 


Tyr 


Leu 


Glu 


Thr 
























590 






















<210> 20 


























<211> 841 


























<212> PRT 


























<213> Homo sapiens 






















<220> 




























<221> misc_feature 






















<223> Incyte ID 


No: 


2762348CD1 
















<400> 20 


























Met Ala 


Ser 


Val 


Phe 


Arg 


Ser 


Glu 


Glu 


Met 


Cys 


Leu 


Ser 


Gin 


Leu 


1 






5 










10 








15 


Phe Leu 


Gin 


Val 


Glu 


Ala 


Ala 


Tyr 


Cys 


Cys 


Val 


Ala 


Glu 


Leu 


Gly 








20 










25 










30 


Glu Leu 


Gly 


Leu 


Val 


Gin 


Phe 


Lys 


Asp 


Leu 


Asn 


Met 


Asn 


Val 


Asn 








35 










40 










45 


Ser Phe 


Gin 


Arg 


Lys 


Phe 


Val 


Asn 


Glu 


Val 


Arg 


Arg 


Cys 


Glu 


Ser 








50 










55 










60 


Leu Glu 


Arg 


He 


Leu 


Arg 


Phe 


Leu 


Glu 


Asp 


Glu 


Met 


Gin 


Asn 


Glu 








65 










70 










75 


He Val 


Val 


Gin 


Leu 


Leu 


Glu 


Lys 


Ser 


Pro 


Leu 


Thr 


Pro 


Leu 


Pro 








80 










85 










90 



17/60 




WO 00/78953 PCTAJS00/16668 



Arg 


Glu 


Met 


He 


Thr 


Leu 


Glu 


Thr 


Val 


Leu 


Glu 


Lys 


Leu 


Glu 


Gly 










95 










100 








105 


blU 


Leu 


Gin 


Glu 


Ala 


Asn 


Gin 


Asn 


Gin 


Gin 


Ala 


Leu 


Lys 


Gin 


Ser 










110 










115 










120 


rTie 


Leu 


GlU 


Leu 


Thr 


Glu 


Leu 


Lys 


Tyr 


Leu 


Leu 


Lys 


Lys 


Thr 


Gin 










125 










130 








135 


Asp 


Phe 


Phe 


Glu 


Thr 


Glu 


Thr 


Asn 


Leu 


Ala 


Asp 


Asp 


Phe 


Phe 


Thr 










140 










145 






150 


GlU 


Asp 


Thr 


Ser 


Gly 


Leu 


Leu 


Glu 


Leu 


Lys 


Ala 


Val 


Pro 


Ala 


Tyr 










155 










160 










165 


Met 


Tnx 


Gly 


Lys 


Leu 


Gly 


Phe 


He 


Ala 


Gly 


Cys 


Asp 


Pro 


Thr 


Gly 










170 










175 










180 


Lys 


Arg 


Met 


Ala 


Ser 


Phe 


Glu 


Arg 


Leu 


Leu 


Trp 


Arg 


Val 


Cys 


Arg 










185 










190 










195 


Gly 


Asn 


Val 


Tyr 


Leu 


Lys 


Phe 


Ser 


Glu 


Met 


Asp 


Ala 


Pro 


Leu 


Glu 










200 










205 








210 


Asp 


Pro 


Val 


Thr 


Lys 


Glu 


Glu 


lie 


Gin 


Lys 


His 


He 


Phe 


lie 


He 










215 










220 










225 


Phe 


Tyr 


Gin 


Gly 


Glu 


Gin 


Leu 


Arg 


Gin 


Lys 


He 


Lys 


Lys 


lie 


Cys 










230 










235 










240 


Asp 


Gly 


Phe 


Arg 


Ala 


Thr 


Val 


Tyr 


Pro 


Cys 


Pro 


Glu 


Pro 


Ala 


Val 










245 










250 










255 


Glu 


Arg 


Arg 


Glu 


Met 


Leu 


Glu 


Ser 


Val 


Asn 


Val 


Arg 


Leu 


Glu 


Asp 










260 










265 










270 


Leu 


He 


Thr 


Val 


He 


Thr 


Gin 


Thr 


Glu 


Ser 


His 


Arg 


Gin 


Arg 


Leu 










275 










280 








285 


Leu 


Gin 


Glu 


Ala 


Ala 


Ala 


Asn 


Trp 


His 


Ser 


Trp 


Leu 


He 


Lys 


Val 










290 










295 








300 


Gin 


Lys 


Met 


Lys 


Ala 


Val 


Tyr 


His 


lie 


Leu 


Asn 


Met 


Cys 


Asn 


lie 










305 










310 










315 


Asp 


Val 


Thr 


Gin 


Gin 


Cys 


Val 


He 


Ala 


Glu 


lie 


Trp 


Phe 


Pro 


Val 










320 










325 








330 


Ala 


Asp 


Ala 


Thr 


Arg 


He 


Lys 


Arg 


Ala 


Leu 


Glu 


Gin 


Gly 


Met 


Glu 










335 










340 








345 


Leu 


Ser 


Gly 


Ser 


Ser 


Met 


Ala 


Pro 


He 


Met 


Thr 


Thr 


Val 


Gin 


Ser 










350 










355 










360 


Lys 


Thr 


Ala 


Pro 


Pro 


Thr 


Phe 


Asn 


Arg 


Thr 


Asn 


Lys 


Phe 


Thr 


Ala 










365 










370 








375 


Gly 


Phe 


Gin 


Asn 


lie 


Val 


Asp 


Ala 


Tyr 


Gly 


Val 


Gly 


Ser 


Tyr 


Arg 










380 










385 










390 


Glu 


He 


Asn 


Pro 


Ala 


Pro 


Tyr 


Thr 


lie 


He 


Thr 


Phe 


Pro 


Phe 


Leu 










395 










400 










405 


Phe 


Ala 


Val 


Met 


Phe 


Gly 


Asp 


Cys 


Gly 


His 


Gly 


Thr 


Val 


Met 


Leu 










410 










415 










420 


Leu 


Ala 


Ala 


Leu 


Trp 


Met 


He 


Leu 


Asn 


Glu 


Arg 


Arg 


Leu 


Leu 


Ser 










425 










430 










435 


Gin 


Lys 


Thr 


Asp 


Asn 


Glu 


He 


Trp 


Asn 


Thr 


Phe 


Phe 


His 


Gly 


Arg 










440 










445 








450 


Tyr 


Leu 


He 


Leu 


Leu 


Met 


Gly 


lie 


Phe 


Ser 


lie 


Tyr 


Thr 


Gly 


Leu 










455 










460 






465 


lie 


Tyr 


Asn 


Asp 


Cys 


Phe 


Ser 


Lys 


Ser 


Leu 


Asn 


lie 


Phe 


Gly 


Ser 










470 










475 








480 


Ser 


Trp 


Ser 


Val 


Gin 


Pro 


Met 


Phe 


Arg 


Asn 


Gly 


Thr 


Trp 


Asn 


Thr 


nx o 








485 










490 








495 


vai 


net. 


GlU 


Glu 


Ser 


Leu 


Tyr 


Leu 


Gin 


Leu 


Asp 


Pro 


Ala 


He 










500 










505 








510 


Pro 


Gly 


val 


Tyr 


Pne 


Gly 


Asn 


Pro 


Tyr 


Pro 


Phe 


Gly 


lie 


Asp 


Pro 










515 










520 






525 


He 


Tro 


Asn 


Leu 


Ala 


Ser 


Asn 


JLys 


Leu 


1 1 IX 




Leu 


Asn 




Tyr 










530 










535 










540 


Lys 


Met 


Lys 


Met 


Ser 


Val 


He 


Leu 


Gly 


He 


Val 


Gin 


Met 


Val 


Phe 










545 










550 










555 


Gly 


Val 


He 


Leu 


Ser 


Leu 


Phe 


Asn 


His 


lie 


Tyr 


Phe 


Arg 


Arg 


Thr 










560 










565 










570 


Leu 


Asn 


He 


He 


Leu 


Gin 


Phe 


lie 


Pro 


Glu 


Met 


lie 


Phe 


lie 


Leu 










575 










580 










585 


Cys 


Leu 


Phe 


Gly 


Tyr 


Leu 


Val 


Phe 


Met 


He 


He 


Phe 


Lys 


Trp 


Cys 



590 595 600 
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Glu 



605 
. Met 
620 
Lys 
635 
He 
650 
Ala 
665 
Asp 
680 
Arg 
695 
Asp 
710 
Ala 
725 
Ala 
740 
Leu 
755 
Thr 
770 
Val 
785 
Leu 
800 
Gin 
815 
Ser 
830 



Val 


Ser 


Gin His Ala 


Pro 


Ser He Leu He 


Phe 




610 




615 


Leu 


Phe Asn Tyr 


Ser 


Asp Ser Ser Asn 


His 




625 




630 


Gin 


Gin Glu Val 


Gin 


Ser Phe Phe Val 






640 




645 


Ser 


Val 


Pro Trp Met 


Leu 


Leu lie Lys Pro 


Ser 


His 


655 




660 


Arg Lys Ser 


Gin 


Leu Gin Ala Ser 


Ala 




670 




675 


Thr 


Glu Asn He 


Glu Gly Asp Ser Ser 






685 




690 


Ser 


Gly Gin Arg Thr 


Ser 


Ala Asp Thr His 


His 




700 




705 


Gly Glu Glu Phe 


Asn 


Phe Gly Asp Val 


He 




715 




720 


His 


Thr lie Glu 


Tyr 


Cys Leu Gly Cys 






730 




735 


Ser 


Tyr 


Leu Arg Leu 


Trp 


Ala Leu Ser Leu 


Ser 


Glu 


745 




750 


Val Leu Trp Thr Met Val Met Asn 






760 




765 


Arg 


Gly Trp Gly Gly He 


Val Gly Val Phe 






775 




780 


Phe 


Ala 


Val Leu Thr 


Val 


Ala He Leu Leu 






790 




795 


Ser 


Ala 


Phe Leu His 


Ala 


Leu Arg Leu His 






805 




810 


Asn 


Lys 


Phe Tyr Val 


Gly Asp Gly Tyr Lys 


Phe 




820 




825 


Lys 


His He Leu 


Asp 


Gly Thr Ala Glu 






835 




840 



<210> 21 
<211> 253 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3715961CD1 

<400> 21 
Met < 
1 



Glu 


Cys 


Pro 


Leu 


He 


Leu 


Tyr 


lie His Lys 


His 


He 


Asp 


Ser 


Gin 


5 










10 






15 


Ser 


Tyr 


Leu 


Phe 


Asn 


Asp Leu Phe 


Tyr 


Pro 


Val 


Gly 


Gly 


20 










25 




30 


Arg 


Met 


Val 


Thr 


Tyr 


Glu His Leu 


Arg 


Glu 


Val 


Gly 




35 










40 




45 


Lys 


Ser 


Glu 


Asp 


Glu 


His 


Tyr Pro Leu 


Trp 


Lys 


Ser 


Gly 




50 










55 


60 


Gly 


Met 


Met 


Ala 


Gly Val 


He Gly Gin 


Phe 


Leu 


Ala 


Thr 




65 










70 






75 


Asp 


Leu 


Val 


Lys 


Val 


Gin 


Met Gin Met 


Glu 


Gly Lys 


Leu 


Glu 


80 










85 






90 


Gly Lys 


Pro 


Leu 


Arg Phe Arg Gly 


Val 


His 


His 


Ala 




95 










100 






105 


Lys 


He 


Leu 


Ala 


Glu Gly Gly He Arg 


Gly Leu Trp 


Trp 


Val 


110 










115 






120 


Pro 


Asn 


He 


Gin 


Arg 


Ala Ala Leu 


Val 


Asn 


Met 


Leu 


Thr 


125 










130 






135 


Thr 


Tyr 


Asp 


Thr 


Val 


Lys His Tyr 


Leu 


Val 


Leu 


Pro 




140 










145 






150 


Leu 


Glu 


Asp 


Asn 


He 


Met 


Thr His Gly 


Leu 


Ser 


Ser 


Ser 




155 










160 






165 


Gly 


Leu 
170 


Val 


Ala 


Ser 


He 


Leu Gly Thr 
175 


Pro 


Ala 


Asp 
180 
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Val lie Lys Ser Arg lie Met Asn Gin Pro Arg Asp Lys Gin Gly 

185 190 195 

Arg Gly Leu Leu Tyr Lys Ser Ser Thr Asp Cys Leu He Gin Ala 

200 205 210 

Val Gin Gly Glu Gly Phe Met Ser Leu Tyr Lys Gly Phe Leu Pro 

215 220 225 

Ser Trp Leu Arg Met Thr Pro Trp Ser Met Val Phe Trp Leu Thr 

230 235 240 

Tyr Glu Lys He Arg Glu Met Ser Gly Val Ser Pro Phe 

245 250 

<210> 22 
<211> 229 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 5108194CD1 

<400> 22 

Met Gly Asn Gly Val Lys Glu Gly Pro Val Arg Leu His Glu Asp 

15 10 15 

Ala Glu Ala Val Leu Ser Ser Ser Val Ser Ser Lys Arg Asp His 

20 25 30 

Arg Gin Val Leu Ser Ser Leu Leu Ser Gly Ala Leu Ala Gly Ala 

35 40 45 

Leu Ala Lys Thr Ala Val Ala Pro Leu Asp Arg Thr Lys He He 

50 55 60 

Phe Gin Val Ser Ser Lys Arg Phe Ser Ala Lys Glu Ala Phe Arg 

65 70 75 

Val Leu Tyr Tyr Thr Tyr Leu Asn Glu Gly Phe Leu Ser Leu Trp 

80 85 90 

Arg Gly Asn Ser Ala Thr Met Val Arg Val Val Pro Tyr Ala Ala 

95 100 105 

He Gin Phe Ser Ala His Glu Glu Tyr Lys Arg He Leu Gly Ser 

110 115 120 

Tyr Tyr Gly Phe Arg Gly Glu Ala Leu Pro Pro Trp Pro Arg Leu 

125 130 135 

Phe Ala Gly Ala Leu Ala Gly Thr Thr Ala Ala Ser Leu Thr Tyr 

140 145 150 

Pro Leu Asp Leu Val Arg Ala Arg Met Ala Val Thr Pro Lys Glu 

155 160 165 

Met Tyr Ser Asn He Phe His Val Phe He Arg He Ser Arg Glu 

170 175 180 

Glu Gly Leu Lys Thr Leu Tyr His Gly Phe Met Pro Thr Val Leu 

185 190 195 

Gly Val He Pro Tyr Ala Gly Leu Ser Phe Phe Thr Tyr Glu Thr 

200 205 210 

Leu Lys Ser Leu His Arg Glu Tyr Ser Gly Arg Lys Leu He Pro 

215 220 225 

Phe Ser Glu Gly 

<210> 23 
<211> 170 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 5503122CD1 

<400> 23 

Met Tyr Asp Asn Leu Tyr Leu His Gly lie Glu Asp Ser Glu Ala 

15 10 15 

Gly Ser Ala Asp Ser Tyr Thr Ser Arg Pro Ser Asp Ser Asp Val 

20 25 30 

Ser Leu Glu Glu Asp Arg Glu Ala He Arg Gin Glu Arg Glu Gin 

35 40 45 
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Gin 


Ala 


Ala 


He 


Gin Leu 


Glu 


Arg 


Ala 










50 






Phe 


Ala 


Val 


Lys 


Thr Asn 


Val 


Ser 


Tyr 










65 








Asp 


Val 


Pro 


Val 


Pro Ser 


Thr 


Ala 


He 










80 








Phe 


Leu 


His 


He 


Lys Glu 
95 


Lys 


Tyr 


Asn 


Arg 


Leu 


Val 


Lys 


Glu Gly 


Cys 


Glu 


lie 










110 








Leu 


Arg 


Leu 


Glu 


Asn He 


Arg 


He 


Gin 










125 








Arg 


Phe 


His 


Gly 


Gly Lys 


Ser 


Ser Gly 










140 








Gly 


Glu 


Met 


Val 


Ser Gly Thr 


Phe 


Arg 










155 








Gly 


Glu 


Gly Cys 


Ser 
















170 









<210> 24 
<211> 655 
<212> PRT 

<213> Homo sapiens 




PCT/US00/I6668 



Lys 


Ser 


Lys 


Pro 


Val 


Ala 


55 










60 


Cys 


Gly Ala 


Leu 


Asp 


Glu 


70 










75 


Ser 


Phe 


Asp 


Ala 


Lys 


Asp 


85 










90 


Asn 


Asp 


Trp 


Trp 


He 


Gly 


100 










105 


Gly 


Phe 


He 


Pro 


Ser 


Pro 


115 










120 


Gin 


Glu 


Gin 


Lys 


Arg 


Gly 


130 










135 


Asn 


Ser 


Ser 


Ser 


Ser 


Leu 


145 










150 


Ala 


Thr 


Pro 


Thr 


Ser 


Thr 


160 










165 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 5517972CD1 



<400> 24 








Met 


Ser 


Ser 


Ser 


Asn Val 


Glu 


1 








5 




Asn 


Thr 


Asn 


Gly 


Phe Pro 
20 


Ala 


Phe 


Thr 


Glu 


Gly 


Ala Val 
35 


Leu 


Val 


Lys 


Leu 


Lys 


Ser Gly 
50 


Phe 


Lys 


Glu 


He 


Leu 


Ser Asn 
65 


He 


Asn 


Ala 


He 


Leu 


Gly Pro 
80 


Thr 


Asp 


Val 


Leu 


Ala 


Ala Arg 
95 


Lys 


Val 


Leu 


He 


Asn 


Gly Ala 
110 


Pro 


Ser 


Gly 


Tyr 


Val 


Val Gin 
125 


Asp 


Val 


Arg 


Glu 


Asn 


Leu Gin 
140 


Phe 


Thr 


Met 


Thr 


Asn 


His Glu 
155 


Lys 


Gin 


Glu 


Leu 


Gly 


Leu Asp 
170 


Lys 


Gin 


Phe 


He 


Arg 


Gly Val 
185 


Ser 


He 


Gly 


Met 


Glu 


Leu He 
200 


Thr 


Glu 


Pro 


Thr 


Thr 


Gly Leu Asp 










215 




Leu 


Leu 


Leu 


Lys 


Arg Met 
230 


Ser 


Ser 


He 


His 


Gin 


Pro Arg 
245 


Tyr 


Leu 


Thr 


Leu 


Leu 


Ala Ser Gly 










260 




Gin 


Glu 


Ala 


Leu 


Gly Tyr 
275 


Phe 


Ala 


Tyr 


Asn 


Asn 


Pro Ala 
290 


Asp 



Val 


Phe 


He 


Pro 


Val 


Ser 


Gin 


Gly 






10 










15 


Thr 


Ala 


Ser 


Asn Asp 


Leu 


Lys 


Ala 






25 










30 


Ser 


Phe 


His 


Asn 


He 


Cys 


Tyr 


Arg 






40 










45 


Leu 


Pro 


Cys 


Arg 


Lys 


Pro 


Val 


Glu 






55 










60 


Asn Gly 


He 


Met 


Lys 


Pro 


Gly 


Leu 






70 










75 


Gly Gly Gly Lys 


Ser 


Ser 


Leu 


Leu 






85 










90 


Asp 


Pro 


Ser 


Gly Leu 


Ser 


Gly Asp 






100 










105 


Arg 


Pro 


Ala 


Asn 


Phe 


Lys 


Cys 


Asn 






115 










120 


Asp Val 


Val 


Met 


Gly 


Thr 


Leu 


Thr 






130 










135 


Ser 


Ala 


Ala 


Leu 


Arg 


Leu 


Ala 


Thr 






145 










150 


Asn 


Glu 


Arg 


He 


Asn 


Arg 


Val 


lie 






160 










165 


Val 


Ala 


Asp 


Ser 


Lys 


Val 


Gly Thr 






175 










180 


Gly Gly Glu 


Arg 


Lys 


Arg 


Thr 


Ser 






190 










195 


Asp 


Pro 


Ser 


He 


Leu 


Phe 


Leu 


Asp 






205 










210 


Ser 


Ser 


Thr 


Ala 


Asn 


Ala 


Val 


Leu 






220 










225 


Lys 


Gin Gly Arg Thr 


He 


He 


Phe 






235 










240 


Ser 


He 


Phe 


Lys 


Leu 


Phe 


Asp 


Ser 






250 










255 


Arg 


Leu 


Met 


Phe 


His 


Gly 


Pro 


Ala 






265 










270 


Glu 


Ser 


Ala 


Gly Tyr 


His 


Cys 


Glu 






280 










285 


Phe 


Phe 


Leu 


Asp 


He 


lie 


Asn 


Gly 






295 










300 
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Asp Ser 


Thr 


Ala 


Val 


Ala 


Leu 


Asn 


Arg 








305 








Thr Glu 


He 


He 


Glu 


Pro 


Ser 


Lys 


Gin 








320 








Lys Leu 


Ala 


Glu 


He 


Tyr 


Val 


Asn 


Ser 








335 










Lys Ala 


Glu 


Leu 


His 


Gin 


Leu 


Ser 


Gly 








350 








lie Thr 


Val 


Phe 


Lys 


Glu 


lie 


Ser 


Tyr 








365 








Gin Leu 


Arg 


Trp 


Val 


Ser 


Lys 


Arg 


Ser 








380 










Asn Pro 


Gin 


Ala 


Ser 


He 


Ala 


Gin 


He 








395 










Gly Leu 


Val 


He 


Gly 


Ala 


He 


Tyr 


Phe 








410 










Thr Gly 


He 


Gin 


Asn 


Arg 


Ala 


Gly 


Val 








425 










Asn Gin 


Cys 


Phe 


Ser 


Ser 


Val 


Ser 


Ala 








440 










Glu Lys 


Lys 


Leu 


Phe 


He 


His 


Glu 


Tyr 








455 








Val Ser 


Ser 


Tyr 


Phe 


Leu 


Gly 


Lys 


Leu 








470 










Met Arg 


Met 


Leu 


Pro 


Ser 


He 


He 


Phe 








485 










Met Leu 


Gly Leu 


Lys 


Pro 


Lys 


Ala 


Asp 








500 










Phe Thr 


Leu 


Met 


Met 


Val 


Ala 


Tyr 


Ser 








515 








Ala He 


Ala 


Ala 


Gly 


Gin 


Ser 


Val 


Val 








530 










Met Thr 


He 


Cys 


Phe 


Val 


Phe 


Met 


Met 








545 










Val Asn 


Leu 


Thr 


Thr 


He 


Ala 


Ser 


Trp 








560 








Phe Ser 


He 


Pro 


Arg 


Tyr 


Gly 


Phe 


Thr 








575 










Phe Leu 


Gly Gin 


Asn 


Phe 


Cys 


Pro 


Gly 








590 










Asn Pro 


Cys 


Asn 


Tyr 


Ala 


Thr 


Cys 


Thr 








605 










Lys Gin 


Gly 


He 


Asp 


Leu 


Ser 


Pro 


Trp 








620 










Val Ala 


Leu 


Ala 


Cys 


Met 


He 


Val 


He 








635 










Leu Lys 


Leu 


Leu 


Phe 


Leu 


Lys 


Lys 


Tyr 








650 










<210> 25 














<211> 184 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_feature 










<223> Incyte ID 


No: 


5593114CD1 




<400> 25 














Met Trp 


Val 


Phe 


Gly 


Tyr 


Gly 


Ser 


Leu 


1 






5 










Pro Tyr 


Gin 


Asp 


Lys 


Leu 


Val 


Gly 


Tyr 








20 










Arg Phe 


Trp 


Gin 


Gly 


Ser 


Thr 


Asp 


His 








35 










Pro Gly Arg 


Val 


Val 


Thr 


Leu 


Val 


Glu 








50 










Trp Gly Val 


Ala 


Tyr 


Arg 


Leu 


Pro 


Val 








65 












PCT/US00/16668 



Glu 


Glu 


Asp 


Phe 


Lys 


Ala 


310 










315 


Asp 


Lys 


Pro 


Leu 


He 


Glu 


325 










330 


Ser 


Phe 


Tyr 


Lys 


Glu 


Thr 


340 










345 


Gly 


Glu 


Lys 


Lys 


Lys 


Lys 


355 










360 


Thr 


Thr 


Ser 


Phe 


Cys 


His 


370 










375 


Phe 


Lys 


Asn 


Leu 


Leu 


Gly 


385 










390 


He 


Val 


Thr 


Val 


Val 


Leu 


400 










405 


Gly 


Leu 


Lys 


Asn 


Asp 


Ser 


415 










420 


Leu 


Phe 


Phe 


Leu 


Thr 


Thr 


430 










435 


Val 


Glu 


Leu 


Phe 


Val 


Val 


445 










450 


He 


Ser Gly 


Tyr 


Tyr 


Arg 


460 










465 


Leu 


Ser 


Asp 


Leu 


Leu 


Pro 


475 










480 


Thr 


Cys 


He 


Val 


Tyr 


Phe 


490 










495 


Ala 


Phe 


Phe 


Val 


Met 


Met 


505 










510 


Ala 


Ser 


Ser 


Met 


Ala 


Leu 


520 










525 


Ser 


Val 


Ala 


Thr 


Leu 


Leu 


535 










540 


He 


Phe 


Ser 


Gly Leu 


Leu 


550 










555 


Leu 


Ser 


Trp 


Leu 


Gin 


Tyr 


565 










570 


Ala 


Leu 


Gin 


His 


Asn 


Glu 


580 










585 


Leu 


Asn 


Ala 


Thr Gly 


Asn 


595 










600 


Gly 


Glu 


Glu 


Tyr 


Leu 


Val 


610 










615 


Gly 


Leu 


Trp 


Lys 


Asn 


His 


625 










630 


Phe 


Leu 


Thr 


He 


Ala 


Tyr 


640 










645 



Ser 
655 



He 


Trp 


Lys 


Val Asp 


Phe 


10 








15 


lie 


Thr 


Asn 


Tyr Ser 


Arg 


25 








30 


Arg 


Gly Val 


Pro Gly 


Lys 


40 








45 


Asp 


Pro 


Ala 


Gly Cys 


Val 


55 








60 


Gly 


Lys 


Glu 


Glu Glu 


Val 


70 








75 
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Lys 


Ala 


Tyr 


Leu 


Asp 


Phe 


Arg 


Glu 










80 






Thr 


Val 


He 


Phe 


Tyr 


Pro 


Lys 


Asp 










95 






va± 


Leu 


Leu 


Tyr 


He Gly Thr 


Cys 










110 






Pro 




Pro 


Leu 


Glu 
125 


Asp 


He 


Ala 


Gly 


Pro 


Ser 


Gly 


Arg 
140 


Asn 


Thr 


Glu 


Ser 


He 


Arg 


Asn 


Leu 
155 


Val 


Pro 


Glu 


Ala 


Leu 


Glu 


Lys 


Leu 


Val 


Lys 


Glu 










170 






Leu 


Asn 


Cys 


He 











85 90 
Thr Thr Lys Pro Phe Ser 
100 105 
Asn Pro Asp Tyr Leu Gly 
115 120 
Gin He Phe Asn Ala Ala 
130 135 
Leu Phe Glu Leu Ala Asn 
145 150 
Ala Asp Glu His Leu Phe 
160 165 
Leu Glu Gly Lys Gin Asn 
175 180 



<210> 26 
<211> 154 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 044775CD1 

<400> 26 



Met 


Gly Ala 


Phe 


Glu 


Cys 


Val 


Arg 


1 








5 






Leu 


Lys 


Gly 


Phe 


Tyr 


Arg 


Gly Met 










20 








Ser 


Glu 


Thr 


Val 


He 
35 


His 


Phe 


Val 


Lys 


Leu 


Leu 


Glu 


Tyr 
50 


Lys 


Thr 


Ala 


Glu 


Ser 


Val 


Lys 


Glu 


Ala 


Ser 


Asp 










65 






Ala 


Ala 


Thr 


Ser 


Lys 


Thr 


Cys 


Ala 










80 






Glu 


Val 


Val 


Arg 


Thr 


Arg 


Leu 


Arg 










95 






Ser 


Phe 


Phe 


Gin 


Thr 
110 


Leu 


Ser 


Leu 


Gly Ser 


Leu 


Tyr 


Arg Gly 


Leu 


Thr 










125 








Pro 


Asn 


Thr 


Ala 


He 
140 


Met 


Met 


Ala 


Leu 


Leu 


Asn 


Gly 











<210> 27 
<211> 438 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 116588CD1 



<400> 27 



Met 
1 


Leu 


Leu 


Val 


Thr 


Pro 


Arg 


Pro 


Glu 


Leu Gly Glu 


5 

Phe 


Cys 


Gly Thr 










20 








Phe 


Cys 


Tyr 


Asp 


Asn 


Pro 


Ala 


Ala 










35 








Asp 


Met 


Gin 


Val 


Asn 


Thr 


Thr 


Lys 










50 






Tyr 


Ser 


Trp 


Pro 


Asn 


Val 


Val 


Leu 










65 









Lys 


Val 


Tyr Gin 


Thr 


Asp 


Gly 




10 








15 


Ser 


Ala 


Ser Tyr 


Ala 


Gly 


He 




25 








30 


He 


Tyr 


Glu Ser 


He 


Lys 


Gin 




40 








45 


Ser 


Thr 


Met Glu 


Asn 


Asp 


Glu 




55 






60 


Phe 


Val 


Gly Met 


Met 


Leu 


Ala 




70 








75 


Thr 


Thr 


He Ala 


Tyr 


Pro 


His 




85 








90 


Glu Glu Gly Thr Lys Tyr Arg 




100 








105 


Leu 


Val 


Gin Glu Glu Gly Tyr 




115 








120 


Thr 


His 


Leu Val 


Arg 


Gin 


He 




130 








135 


Thr 


Tyr 


Glu Leu 


Val 


Val 


Tyr 




145 








150 



Glu 


Arg Gly Gly Arg Gly Thr 




10 








15 


Pro 


Leu 


Leu 


Phe Ser 


Ser 


Tyr 




25 








30 


Leu 


Gin 


Thr 


Gin Val 


Lys 


Arg 




40 








45 


Phe 


Met 


Leu 


Leu Tyr 


Ala 


Trp 




55 








60 


Cys 


Phe 


Phe 


Gly Gly 


Phe 


Leu 




70 








75 
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lie Asp Arg 


Val 


Phe 


Gly He 


Arg 


Trp 


Gly Thr 


He 


He 


Phe 


Ser 






80 








85 










90 


Cys Phe Val 


Cys 


lie 


Gly Gin 


Val 


Val 


Phe 


Ala 


Leu 


Gly 


Gly 


He 


Phe Asn Ala 




95 








100 






105 


Phe 


Trp 


Leu Met 


Glu 


Phe 


Gly Arg 


Phe 


Val 


Phe Gly 


lie Gly Gly 




110 








115 










120 


Glu 


Ser 


Leu Ala 


Val 


Ala 


Gin 


Asn 


Thr 


Tyr 


Ala 


Val 






125 








130 








135 


Ser Trp Phe 


Lys 


Gly 


Lys Glu 


Leu 


Asn 


Leu 


Val 


Phe Gly Leu Gin 






140 








145 










150 


Leu Ser Met 


Ala 


Arg 


He Gly 


Ser 


Thr 


Val 


Asn 


Met 


Asn 


Leu 


Met 


Gly Trp Leu 




155 








160 










165 


Tyr 


Ser 


Lys He 


Glu 


Ala 


Leu 


Leu 


Gly 


Ser 


Ala 


Gly 


His Thr Thr 




170 








175 








180 


Leu 


Gly 


He Thr 


Leu 


Met 


He 


Gly 


Gly Val 


Thr 


Cys 


lie Leu Ser 




185 








190 










195 


Leu 


He 


Cys Ala 


Leu 


Ala 


Leu 


Ala 




Leu 


Asp 


Gin 


Arg Ala Glu 




200 








205 






210 


Arg 


He 


Leu His 


Lys 


Glu 


Gin Gly 


Lys 


Thr Gly Glu 


Val lie Lys 




215 








220 










225 


Leu 


Thr 


Asp Val 


Lys 


Asp 


Phe 


Ser 


Leu 


Pro 


Leu 


Trp 






230 








235 










240 


Leu lie Phe 


He 


He 


Cys Val 


Cys 


Tyr 


Tyr 


Val 


hid 


Val 


Phe 


Pro 


Phe He Gly 




245 








250 










255 


Leu 


Gly 


Lys Val 


Phe 


Phe 


Thr 


Glu 


Lys 


Phe 


Gly 


Phe 






260 








265 






270 


Ser Ser Gin 


Ala 


Ala 


Ser Ala 


He 


Asn 


Ser 


val 


Val 


Tyr 


Val 


He 






275 








280 








285 


Ser Ala Pro 


Met 


Ser 


Pro Val 


Phe 


Gly 


Leu 


Leu 


Val 


Asp 


Lys 


Thr 


Gly Lys Asn 




290 








295 






300 


He 


lie 


Trp Val 


Leu 


Cys 


Ala 


Val 


Ala 


Ala 


Thr 


Leu 


Val Ser His 




305 


















315 


Met 


Met 


Leu Ala 


Phe 


Thr 


Met 


Trn 


Asn 


Pro 


Trp 


He 


Ala Met Cys 




320 








325 








330 


Leu 


Leu 


Gly Leu 


Ser 


Tyr 


Ser 


Leu 


Leu 


Ala 


Cys 


Ala 






335 








340 








345 


Leu Trp Pro 


Met 


Val 


Ala Phe 


Val 


Val 


Pro 


Glu 


His 


Gin Leu Gly 


Thr Ala Tyr 




350 








355 










360 


Gly 


Phe 


Met Gin 


Ser 


He 


Gin 


Asn 


Leu 


Gly 


Leu 


Ala 


i-Le lie ber 




365 








370 








375 


lie 


lie 


Ala Gly 


Met 


He 


Leu 


Asp 


Ser 


Arg Gly Tyr 






380 








385 










390 


Leu Phe Leu 


Glu 


Val 


Phe Phe 


He 


Ala 


Cys 


Val 


Ser 


Leu 


Ser 


Leu 


Leu Ser Val 




395 








400 










405 


Val 


Leu 


Leu Tyr 


Leu 


Val 


Asn 


Arg 


Ala 


Gin 


Gly Gly 






410 








415 










420 


Asn Leu Asn 


Tyr 


Ser 


Ala Arg 


Gin 


Arg 


Glu 


Glu 


He 


Lys 


Phe 


Ser 






425 








430 








435 


His Thr Glu 






















<210> 28 
























<211> 237 
























<212> PRT 
























<213> Homo sapiens 




















<220> 
























<221> misc_feature 




















<223> Incyte ID 


No: 


875369CD1 
















<400> 28 
























Met Ala His 


Val 


Gly 


Ser Arg 


Lys 


Arg 


Ser 


Arg 


Ser 


Arg 


Ser 


Arg 


1 




5 








10 










15 


Ser Arg Gly Arg Gly 


Ser Glu 


Lys 


Arg 


Lys 


Lys 


Lys 


Ser 


Arg 


Lys 


Asp Thr Ser 




20 








25 








30 


Arg 


Asn 


Cys Ser 


Ala 


Ser 


Thr 


Ser 


Gin Gly Arg 


Lys 


Ala Ser Thr 




35 








40 










45 


Ala 


Pro 


Gly Ala 


Glu 


Ala 


Ser 


Pro 


Ser 


Pro 


Cys 


He 






50 








55 








60 



24/60 




WO 00/78953 



Thr Glu 


Arg 


Ser 


Lys 


Gin 


Lys 


Ala 


Arg 








65 






Ser Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 








80 










Ser Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


Asp 








95 








Lys Tyr 


Lys 


Asp 


Lys 


Arg 


Arg 


Lys 


Lys 








110 










Leu Lys 


Lys 


Lys 


Gly 


Lys 


Glu 


Lys 


Ala 








125 










Ala Leu 


Pro 


Gly 


Pro 


Ser 


Leu 


Asp 


Gin 








140 








Glu Glu 


Glu 


Asp 


Gly 


Pro 


Val 


Leu 


Thr 








155 










lie Gin 


Ala 


Met 


Lys 


Pro 


Met 


Thr 


Lys 








170 








Gin Ser 


He 


He 


Arg 


Lys 


Val 


Val 


Asp 








185 










Arg Leu 


He 


Lys 


Gly 


Asp 


Gly 


Glu 


Val 








200 










Lys Glu 


Arg 


His 


Arg 


Glu 


He 


Asn 


Lys 








215 










Cys Leu 


Ala 


Phe 


Gin 


Met 


Arg 


Ala Gly 








230 










<210> 29 














<211> 219 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_feature 










<223> Incyte ID 


No: 


1325518CD1 




<400> 29 














Met Lys 


Leu 


Leu 


Leu 


Trp 


Ala 


Cys 


He 


1 






5 










Arg Lys 


Arg 


Arg 


Phe 


Pro 


Phe 


He 


Gly 








20 










Gly His 


Pro 


Leu 


His 


Pro 


Ser 


Leu 


Asn 








35 










Asn Leu 


Pro 


Pro 


Pro 


Leu 


Tyr 


Tyr 


Arg 








50 










Ser Tyr 


Pro 


Gly 


Asn 


Thr 


Tyr 


Thr 


Asp 








65 










Pro Trp 


He 


Leu 


Thr 


Ser 


Pro 


Gly 


Phe 








80 










Arg Gly 


Phe 


Pro 


Leu 


Ala 


Thr 


Gin 


Leu 








95 










Pro Arg 


Gly 


Phe 


Pro 


Phe 


Val 


Pro 


Pro 








110 










Ala Ala 


Ala 


Pro 


Ala 


Ala 


Pro 


Pro 


He 








125 










Ala Ala 


Pro 


Leu 


Thr 


Ala 


Thr 


Pro 


Val 








140 










Gly Ala 


Pro 


Val 


Ala 


Ala 


Glu 


Pro 


Ala 








155 










Ala Glu 


Pro 


Ala 


Ala 


Glu 


Ala 


Pro 


Val 








170 










Glu Ala 


Pro 


Val 


Gly 


Val 


Glu 


Pro 


Ala 








185 










Ala Glu 


Pro 


Ala 


Thr 


Ala 


Lys 


Pro 


Ala 








200 










Ser Pro 


Ser 


Leu 


Glu 


Gin 


Ala 


Asn 


Gin 








215 










<210> 30 














<211> 707 














<212> PRT 














<213> Homo i 


sapiens 












PCT/US00/16668 



Arg 


Arg 


Thr 


Arg 


Ser 


Ser 


70 










75 


Ser 


Ser 


Ser 


Ser 


Ser 


Ser 


85 










90 


Gly Arg 


Lys 


Lys 


Arg 


Gly 


100 










105 


Lys 


Lys 


Lys 


Arg 


Lys 


Lys 


115 










120 


Glu 


Ala 


Gin 


Gin 


Val 


Glu 


130 










135 


Trp 


His 


Arg 


Ser 


Ala 


Gly 


145 










150 


Asp 


Glu 


Gin 


Lys 


Ser 


Arg 


160 










165 


Glu 


Glu 


Trp 


Asp 


Ala 


Arg 


175 










180 


Pro 


Glu 


Thr 


Gly Arg 


Thr 


190 










195 


Leu 


Glu 


Glu 


He 


Val 


Thr 


205 










210 


Gin Ala Thr Arg Gly 


Asp 


220 










225 


Leu 


Leu 


Pro 








235 













Val 


Cys 


Val 


Ala 


Phe 


Ala 


10 










15 


Glu 


Asp 


Asp 


Asn 


Asp 


Asp 


25 










30 


He 


Pro 


Tyr 


Gly 


He 


Arg 


40 










45 


Pro 


Val 


Asn 


Thr 


Val 


Pro 


55 










60 


Thr Gly Leu 


Pro 


Ser 


Tyr 


70 










75 


Pro 


Tyr 


Val 


Tyr 


His 


He 


85 










90 


Asn 


Val 


Pro 


Pro 


Leu 


Pro 


100 










105 


Ser 


Arg 


Phe 


Phe 


Ser 


Ala 


115 










120 


Ala 


Ala 


Glu 


Pro 


Ala 


Ala 


130 










135 


Ala 


Ala 


Glu 


Pro 


Ala 


Ala 


145 










150 


Ala 


Glu 


Ala 


Pro 


Val 


Gly 


160 










165 


Ala 


Ala 


Glu 


Pro 


Ala 


Ala 


175 










180 


Ala 


Glu 


Glu 


Pro 


Ser 


Pro 


190 










195 


Ala 


Pro 


Glu 


Pro 


His 


Pro 


205 










210 



25/60 




WO 00/78953 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2060987CD1 

<400> 30 



Met Ala Ala 
1 


Ala 


Ala 


Thr 


Ala 


Ala 


Glu 


Pro Pro Gly Glu Val 


He 


His 


Leu 


Asn 






20 










Ser Thr Ser 


Arg 


Gin 


Thr 


Leu 


Thr 


Trp 






35 








Ser Ser Leu 


Leu 


Ser 


Gly Arg 


He 


Ser 






50 










Gly Ala lie 


Phe 


He 


Asp 


Arg 


Asp 


Pro 






65 








Leu Asn Phe 


Leu 


Arg 


Thr 


Lys 


Glu 


Leu 






80 










Gly Ser Ser 


Leu 


Leu 


His 


Glu 


Ala 


Gin 






95 










Leu Val Arg 


Arg 


Leu 


Gin 


Leu 


Arg 


Glu 






110 








Cys Gly Asn Val 


Leu 


Phe 


Asn 


Gly 


Tyr 






125 






Pro Val Lys 


Arg 


Arg 


Asn 


Arg 


His 


Ser 






140 










Leu Gly Gly Arg 


Pro 


Ala 


Pro 


Val 


Arg 






155 








Pro Asn Leu Gly Asn 


Ala Gly 


Leu 


Leu 






170 










Lys Thr Pro 


Pro 


Ser 


Pro 


Ser 


Gly 


Gin 






185 








Val Arg Leu 


Val 


Cys 


Gly His 


His 


Asn 






200 










Thr Gin Phe 


Leu 


Val 


Cys 


Tyr 


Arg 


Leu 






215 










Gin Leu Val 


Phe 


Ser 


Ser 


Pro 


Arg 


Leu 






230 








Leu Ala Leu 


Thr 


Ala 


Arg 


Val 


His 


Gly 






245 








Asp Lys Met 


Val 


Ala 


Ala 


Ala 


Thr 


Gly 






260 








Ala Leu Gin 


Ala 


Glu 


Gly Gly 


Gly 


Ser 






275 








Leu Gly Val 


Pro 


Val 


Glu 


Ala 


Leu 


Phe 






290 










lie Ala Thr 


Ser 


His 


Thr 


Gly 


Arg 


He 






305 










Thr Lys His 


Trp 


Gin 


Val 


Gin 


Glu 


Val 






320 










Asp Ala Ala 


Gly 


Ser 


Phe 


Leu 


Leu 


Leu 






335 










lie Tyr Tyr 


Val 


Asp 


Val 


Gin 


Lys 


Phe 






350 








Asn Asp Leu 


Leu 


Val 


Ser 


Glu 


Leu 


Tyr 






365 








Gly Val Thr 


Ala 


Leu 


Ser 


Val 


Tyr 


Leu 






380 








Ser Gly Asn Trp lie 


Glu 


He 


Ala 


Tyr 






395 








Val Arg Val 


He 


Val 


Gin 


His 


Pro 


Glu 






410 










Gin Leu Phe 


Gin 


Thr 


Phe 


Thr 


Val 


His 






425 










He Met Leu 


Ser 


Glu 


Lys 


His 


Leu 


He 






440 










Asn His Val 


Arg 


Thr 


Trp 


Ser 


Val 


Thr 






455 










Ser Thr Gin 


Pro Gly 


Ser 


Thr 


Pro 


Leu 
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Gly Val 


Pro 


Ser 


Arg 


Gly 


10 










15 


Val 


Gly Gly Lys 


Arg 


Phe 


25 










30 


He 


Pro 


Asp 


Ser 


Phe 


Phe 


40 










45 


Thr 


Leu 


Lys 


Asp 


Glu 


Thr 


55 










60 


Thr 


Val 


Phe 


Ala 


Pro 


He 


70 










75 


Asp 


Pro 


Arg 


Gly Val 


His 


85 










90 


Phe Tyr Gly Leu 


Thr 


Pro 


100 










105 


Glu 


Leu 


Asp 


Arg 


Ser 


Ser 


115 










120 


Leu 


Pro 


Pro 


Pro 


Val 


Phe 


130 










135 


Leu 


Val 


Gly 


Pro 


Gin 


Gin 


145 










150 


Arg 


Ser 


Asn 


Thr 


Met 


Pro 


160 










165 


Gly Arg Met 


Leu 


Asp 


Glu 


175 










180 


Pro 


Glu 


Glu 


Pro 


Gly 


Met 


190 










195 


Trp 


He 


Ala 


Val 


Ala 


Tyr 


205 










210 


Lys 


Glu 


Ala 


Ser 


Gly 


Trp 


220 










225 


Asp 


Trp 


Pro 


lie 


Glu 


Arg 


235 










240 


Gly Ala 


Leu Gly 


Glu 


His 


250 










255 


Ser 


Glu 


He 


Leu 


Leu 


Trp 


265 










270 


Glu 


He 


Gly Val 


Phe 


His 


280 










285 


Phe 


Val 


Gly Asn Gin 


Leu 


295 










300 


Gly Val 


Trp 


Asn 


Ala 


Val 


310 










315 


Gin 


Pro 


He 


Thr 


Ser 


Tyr 


325 










330 


Gly Cys Asn Asn Gly 


Ser 


340 










345 


Pro 


Leu 


Arg 


Met 


Lys 


Asp 


355 










360 


Arg 


Asp 


Pro 


Ala 


Glu 


Asp 


370 










375 


Thr 


Pro 


Lys 


Thr 


Ser 


Asp 


385 










390 


Gly Thr 


Ser 


Ser 


Gly 


Gly 


400 










405 


Thr 


Val 


Gly 


Ser 


Gly 


Pro 


415 










420 


Arg 


Ser 


Pro 


Val 


Thr 


Lys 


430 










435 


Ser 


Val 


Cys 


Ala 


Asp 


Asn 


445 










450 


Arg 


Phe 


Arg Gly Met 


lie 


460 










465 


Ala 


Ser 


Phe 


Lys 


He 


Leu 
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470 










Ala Leu 


Glu 


Ser 


Ala 


Asp Gly 


His 


Gly 








485 










Asp lie 


Gly 


Pro 


Tyr Gly Glu 


Arg 


Asp 








500 










Gin Lys 


Val 


Val 


Pro 


Ser 


Ala 


Ser 


Gin 








515 










Ser Thr 


Gly Gin 


Arg 


Val 


Cys 


Ser 


Val 








530 










Pro Thr 


Thr 


Ala 


Phe 


Thr 


Val 


Leu 


Glu 








545 










Leu Gly 


Ser 


Arg 


Pro 


Arg 


Arg 


Tyr 


Leu 








560 










Gly Ser 


Leu 


Ala 


Met 


Trp 


Asp 


Leu 


Thr 








575 










Gly Gin Ala 


Pro 


Ala Gly Gly 


Leu 


Thr 








590 










Gin Leu 


Glu 


His 


Cys 


Glu 


Leu 


Ala 


Pro 








605 










Ser Trp Gly 


Cys 


Leu 


Pro 


Ser 


Pro 


Ser 








620 










Ser Leu 


His 


Ser 


Ala 


Ser 


Ser 


Asn 


Thr 








635 










Gly Ser 


Pro 


Ser 


Pro 


Pro 


Gin 


Ala 


Glu 








650 










Gly Ser 


Phe 


Val 


Glu 


Arg 


Cys 


Gin 


Glu 








665 










Asp Leu 


Arg 


Arg 


Pro 


Pro 


Thr 


Pro 


Ala 








680 










Leu Gly Thr 


Pro 


Leu 


Thr 


Pro 


Pro 


Lys 








695 








Ser Phe 
















<210> 31 














<211> 279 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_f eature 










<223> Incyte ID 


No: 


2172064CD1 




<400> 31 














Met Cys 


Gly 


Arg 


Phe 


Leu 


Arg 


Arg 


Leu 


1 






5 










Arg Ser 


Thr 


Pro 


Val 


Gly 


Arg 


Leu 


Leu 








20 










Phe Arg 


Leu 


Val 


Leu 


Leu 


Ala 


Ala 


Ser 








35 










Asp Glu 


Gin 


Ser 


Glu 


Phe 


Val 


Cys 


His 








50 










Lys Ala 


Ala 


Cys 


Phe 


Asp 


Ala 


Phe 


His 








65 










Ser Trp 


Val 


Phe 


Gin 


Val 


He 


Leu 


Val 








80 










Tyr Met 


Gly 


Phe 


Thr 


Leu 


Tyr 


His 


Val 








95 










Ser Gly 


Lys 


Gly 


Lys 


Glu 


Glu 


Glu 


Thr 








110 










Gly Asn 


Thr 


Asp 


Val 


Pro 


Gly 


Ala Gly 








125 










Ala Tyr 


Val 


Ala 


Gin 


Leu 


Gly 


Ala 


Arg 








140 










Ala Leu 


Gly 


Leu 


Gin 


Tyr 


His 


Leu 


Tyr 








155 










Ser Phe 


Ala 


Cys 


Arg 


Arg 


Glu 


Pro 


Cys 








170 










Asn Leu 


Ser 


Arg 


Pro 


Ser 


Glu 


Lys 


Thr 
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475 








480 


Gly 


Cys 


Ser 


Ala 


Gly Asn 


490 








495 


Asp 


Gin 


Gin 


Val 


Phe He 


505 








510 


Leu 


Phe 


Val 


Arg 


Leu Ser 


520 








525 


Arg 


Ser 


Val 


Asp Gly Ser 


535 








540 


Cys 


Glu 


Gly 


Ser 


Arg Arg 


550 








555 


Leu 


Thr 


Gly Gin 


Ala Asn 


565 








570 


Thr 


Ala 


Met 


Asp 


Gly Leu 


580 








585 


Glu 


Gin 


Glu 


Leu 


Met Glu 


595 








600 


Pro 


Ala 


Pro 


Ser 


Ala Pro 


610 








615 


Pro 


Arg 


He 


Ser 


Leu Thr 


625 








630 


Ser 


Leu 


Ser 


Gly 


His Arg 


640 








645 


Ala 


Arg 


Arg 


Arg 


Gly Gly 


655 








660 


Leu 


Val 


Arg 


Ser 


Gly Pro 


670 








675 


Pro 


Trp 


Pro 


Ser 


Ser Gly 


685 








690 


Met 


Lys 


Leu 


Asn 


Glu Thr 


700 








705 



Leu 


Ala 


Glu 


Glu 


Ser 


Arg 


10 










15 


Leu 


Pro 


Val 


Leu 


Leu 


Gly 


25 










30 


Gly 


Pro 


Gly Val 


Tyr 


Gly 


40 










45 


Thr 


Gin 


Gin 


Pro 


Gly Cys 


55 










60 


Pro 


Leu 


Ser 


Pro 


Leu 


Arg 


70 










75 


Ala 


Val 


Pro 


Ser 


Ala 


Leu 


85 










90 


lie 


Trp 


His 


Trp 


Glu 


Leu 


100 










105 


Leu 


He 


Gin 


Gly 


Arg 


Glu 


115 










120 


Ser 


Leu 


Arg 


Leu 


Leu 


Trp 


130 










135 


Leu 


Val 


Leu 


Glu 


Gly Ala 


145 










150 


Gly 


Phe 


Gin 


Met 


Pro 


Ser 


160 










165 


Leu 


Gly 


Ser 


lie 


Thr 


Cys 


175 










180 


He 


Phe 


Leu 


Lys 


Thr 


Met 
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185 










190 










195 


Phe Gly Val 


Ser 


Gly 


Phe 


Cys 


Leu 


Leu 


Phe 


Thr 


Phe 


Leu 


Glu 


Leu 






200 










205 










210 


Val Leu Leu 


Gly 


Leu 


Gly Arg 


Trp 


Trp 


Arg 


Thr 


Trp 


Lys 


His 


Lys 






215 










220 










225 


Ser Ser Ser 


Ser 


Lys 


Tyr 


Phe 


Leu 


Thr 


Ser 


Glu 


Ser 


Thr 


Arg 


Arg 






230 










235 










240 


His Lys Lys 


Ala 


Thr 


Asp 


Ser 


Leu 


Pro 


Val 


Val 


Glu 


Thr 


Lys 


Glu 






245 










250 










255 


Gin Phe Gin 


Glu 


Ala 


Val 


Pro 


Gly Arg 


Ser 


Leu 


Ala 


Gin 


Glu 


Lys 






260 










265 










270 


Gin Arg Pro 


Val 


Gly 


Pro 


Arg 


Asp 


Ala 


















275 






















<210> 32 


























<211> 154 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> misc_f eature 






















<223> Incyte ID 


No: 


2219267CD1 
















<400> 32 


























Met Val Thr 


Gly 


Leu 


Ala 


Ser 


Leu 


Leu 


Leu 


Leu 


Ala 


Gly Ala 


Gin 


1 




5 










10 










15 


Tyr Leu Pro 


Gly 


Trp 


Thr 


Val 


Leu 


Phe 


Leu 


Ser 


Val 


Leu 


Gly 


Leu 






20 










25 










30 


Leu Ala Ser 


Arg 


Ala 


Val 


Ser 


Ala 


Leu 


Ser 


Ser 


Leu 


Phe 


Ala 


Ala 






35 










40 










45 


Glu Val Phe 


Pro 


Thr 


Val 


He 


Arg 


Gly Ala 


Gly Leu 


Gly 


Leu 


Val 






50 










55 










60 


Leu Gly Ala 


Gly 


Phe 


Leu 


Gly 


Gin 


Ala 


Ala 


Gly 


Pro 


Leu 


Asp 


Thr 






65 










70 










75 


Leu His Gly 


Arg 


Gin 


Gly 


Phe 


Phe 


Leu 


Gin 


Gin 


Val 


Val 


Phe 


Ala 






80 










85 










90 


Ser Leu Ala 


Val 


Leu 


Ala 


Leu 


Leu 


Cys 


Val 


Leu 


Leu 


Leu 


Pro 


Glu 






95 










100 










105 


Ser Arg Ser 


Arg 


Gly 


Leu 


Pro 


Gin 


Ser 


Leu 


Gin 


Asp 


Ala 


Asp 


Arg 






110 










115 










120 


Leu Arg Arg 


Ser 


Pro 


Leu 


Leu 


Arg 


Gly Arg 


Pro 


Arg 


Gin 


Asp 


His 






125 










130 










135 


Leu Pro Leu 


Leu 


Pro 


Pro 


Ser 


Asn 


Ser 


Tyr 


Trp 


Ala 


Gly 


His 


Thr 






140 










145 










150 


Pro Glu Gin 


His 
























<210> 33 


























<211> 289 


























<212> PRT 


























<213> Homo sapiens 






















<220> 


























<221> misc_f eature 






















<223> Incyte ID 


No: 


2308629CD1 
















<400> 33 


























Met Val Ala 


Gly 


Ala 


Val 


Ala 


Gly 


He 


Leu 


Glu 


His 


Cys 


Val 


Met 


1 




5 










10 










15 


Tyr Pro lie 


Asp 


Cys 


Val 


Lys 


Thr 


Arg 


Met 


Gin 


Ser 


Leu 


Gin 


Pro 






20 










25 










30 


Asp Pro Ala 


Ala 


Arg 


Tyr 


Arg 


Asn 


Val 


Leu 


Glu 


Ala 


Leu 


Trp 


Arg 






35 










40 










45 


lie lie Arg 


Thr 


Glu 


Gly 


Leu 


Trp 


Arg 


Pro 


Met 


Arg Gly 


Leu 


Asn 






50 










55 










60 


Val Thr Ala 


Thr 


Gly 


Ala 


Gly 


Pro 


Ala 


His 


Ala 


Leu 


Tyr 


Phe 


Ala 






65 










70 










75 


Cys Tyr Glu 


Lys 


Leu 


Lys 


Lys 


Thr 


Leu 


Ser 


Asp 


Val 


He 


His 


Pro 






80 










85 










90 


Gly Gly Asn 


Ser 


His 


He 


Ala 


Asn 


Gly Ala 


Ala Gly Cys 


Val 


Ala 
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95 










100 








105 


Thr 


Leu 


Leu 


His 


Asp 


Ala 


Ala 


Met 


Asn 


Pro 


Ala 


Glu Val 


Val 


Lys 










110 










115 








120 


Gin 


Arg 


Met 


Gin 


Met 


Tyr 


Asn 


Ser 


Pro 


Tyr 


His 


Arg Val 


Thr 


Asp 










125 










130 








135 


Cys 


Val 


Arg 


Ala 


Val 


Trp 


Gin 


Asn 


Glu 


Gly Ala Gly Ala 


Phe 


Tyr 










140 










145 








150 


Arg 


Ser 


Tyr 


Thr 


Thr 


Gin 


Leu 


Thr 


Met 


Asn 


Val 


Pro Phe 


Gin 


Ala 










155 










160 








165 


He 


His 


Phe 


Met 


Thr 


Tyr 


Glu 


Phe 


Leu 


Gin 


Glu 


His Phe 


Asn 


Pro 










170 










175 








180 


Gin 


Arg 


Arg 


Tyr 


Asn 


Pro 


Ser 


Ser 


His 


Val 


Leu 


Ser Gly 


Ala 


Cys 










185 










190 








195 


Ala 


Gly 


Ala 


Val 


Ala 


Ala 


Ala 


Ala 


Thr 


Thr 


Pro 


Leu Asp 


Val 


Cys 










200 










205 








210 


Lys 


Thr 


Leu 


Leu 


Asn 


Thr 


Gin 


Glu 


Ser 


Leu 


Ala 


Leu Asn 


Ser 


His 








215 










220 








225 


He 


Thr 


Gly His 


He 


Thr Gly 


Met 


Ala 


Ser 


Ala 


Phe Arg 


Thr 


Val 










230 










235 








240 


Tyr 


Gin 


Val 


Gly 


Gly 


Val 


Thr 


Ala 


Tyr 


Phe Arg Gly Val 


Gin 


Ala 










245 










250 








255 


Arg 


Val 


He 


Tyr 


Gin 


He 


Pro 


Ser 


Thr 


Ala 


He 


Ala Trp 


Ser 


Val 










260 










265 








270 


Tyr 


Glu 


Phe 


Phe 


Lys 


Tyr 


Leu 


He 


Thr 


Lys 


Arg 


Gin Glu 


Glu 


Trp 










275 










280 








285 


Arg 


Ala 


Gly Lys 























<210> 34 
<211> 300 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2660038CD1 

<400> 34 

Met Asp Phe Leu Met Ser Gly Leu Ala Ala Cys Gly Ala Cys Val 
15 10 15 

Phe Thr Asn Pro Leu Glu Val Val Lys Thr Arg Met Gin Leu Gin 
20 25 30 

Gly Glu Leu Gin Ala Pro Gly Thr Tyr Gin Arg His Tyr Arg Asn 
35 40 45 

Val Phe His Ala Phe He Thr He Gly Lys Val Asp Gly Leu Ala 
50 55 60 

Ala Leu Gin Lys Gly Leu Ala Pro Ala Leu Leu Tyr Gin Phe Leu 
65 70 75 

Met Asn Gly He Arg Leu Gly Thr Tyr Gly Leu Ala Glu Ala Gly 
80 85 90 

Gly Tyr Leu His Thr Ala Glu Ala Thr His Ser Pro Ala Arg Ser 
95 100 105 

Ala Ala Ala Gly Ala Met Ala Gly Val Met Gly Ala Tyr Leu Gly 
110 115 120 

Ser Pro He Tyr Met Val Lys Thr His Leu Gin Ala Gin Ala Ala 
125 130 135 

Ser Glu He Ala Val Gly His Gin Tyr Lys His Gin Gly Met Phe 
140 145 150 

Gin Ala Leu Thr Glu He Gly Gin Lys His Gly Leu Val Gly Leu 
155 160 165 

Trp Arg Gly Ala Leu Gly Gly Leu Pro Arg Val He Val Gly Ser 
170 175 180 

Ser Thr Gin Leu Cys Thr Phe Ser Ser Thr Lys Asp Leu Leu Ser 
185 190 195 

Gin Trp Glu He Phe Pro Pro Gin Ser Trp Lys Leu Ala Leu Val 
200 205 210 

Ala Ala Met Met Ser Gly He Ala Val Val Leu Ala Met Ala Pro 
215 220 225 

Phe Asp Val Ala Cys Thr Arg Leu Tyr Asn Gin Pro Thr Asp Ala 
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230 






235 








240 


Gin Gly 


Lys 


Gly 


Leu 


Met 


Tyr Arg Gly 


lie Leu 


Asp 


Ala 


Leu 


Leu 








245 






250 








255 


Gin Thr 


Ala 


Arg 


Thr 


Glu 


Gly lie Phe 


Gly Met 


Tyr 


Lys 


Gly 


He 








260 




265 








270 


Gly Ala 


Ser 


Tyr 


Phe 


Arg 


Leu Gly Pro 


His Thr 


He 


Leu 


Ser 


Leu 








275 






280 








285 


Phe Phe 


Trp 


Asp 


Gin 


Leu 


Arg Ser Leu 


Tyr Tyr 


Thr 


Asp 


Thr 


Lys 








290 




295 








300 



<210> 35 

<211> 382 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2670745CD1 



<400> 35 



Met 


Leu 


Arg 


Trp 


Thr Val 


His 


Leu Glu Gly Gly 


Pro 


Arg 


Arg 


Val 


1 








5 








10 










15 


Asn 


His 


Ala 


Ala 


Val Ala 


Val 


Gly His Arg Val 


Tyr 


Ser 


Phe Gly 










20 








25 










30 


Gly Tyr Cys 


Ser 


Gly Glu 


Asp 


Tyr 


Glu 


Thr 


Leu 


Arg 


Gin 


He 


Asp 










35 








40 










45 


Val 


His 


He 


Phe 


Asn Ala 
50 


Val 


Ser 


Leu 


Arg 
55 


Trp 


Thr 


Lys 


Leu 


Pro 
60 


Pro 


Val 


Lys 


Ser 


Ala He 


Arg 


Gly Gin Ala 


Pro 


Val 


Val 


Pro 


Tyr 










65 








70 










75 


Met 


Arg 


Tyr Gly His Ser 


Thr 


Val 


Leu 


He 


Asp 


Asp 


Thr 


Val 


Leu 










80 








85 










90 


Leu 


Trp Gly Gly Arg Asn Asp 


Thr 


Glu 


Gly 


Ala 


Cys 


Asn 


Val 


Leu 










95 








100 










105 


Tyr 


Ala 


Phe 


Asp 


Val Asn 
110 


Thr 


His 


Lys 


Trp 
115 


Phe 


Thr 


Pro 


Arg 


Val 
120 


Ser 


Gly 


Thr 


Val 


Pro Gly Ala 


Arg 


Asp 


Gly 


His 


Ser 


Ala 


Cys 


Val 










125 








130 










135 


Leu 


Gly Lys 


He 


Met Tyr 


He 


Phe 


Gly Gly 


Tyr 


Glu 


Gin 


Gin 


Ala 










140 








145 










150 


Asp 


Cys 


Phe 


Ser 


Asn Asp 
155 


He 


His 


Lys 


Leu 
160 


Asp 


Thr 


Ser 


Thr 


Met 
165 


Thr 


Trp 


Thr 


Leu 


He Cys 
170 


Thr 


Lys 


Gly 


Ser 
175 


Pro 


Ala 


Arg 


Trp 


Arg 
180 


Asp 


Phe 


His 


Ser 


Ala Thr 
185 


Met 


Leu 


Gly 


Ser 
190 


His 


Met 


Tyr 


Val 


Phe 
195 


Gly Gly Arg Ala 


Asp Arg 


Phe 


Gly 


Pro 


Phe 


His 


Ser 


Asn 


Asn 


Glu 










200 








205 










210 


He 


Tyr 


Cys 


Asn 


Arg lie 
215 


Arg 


Val 


Phe 


Asp 
220 


Thr 


Arg 


Thr 


Glu 


Ala 
225 


Trp 


Leu 


Asp 


Cys 


Pro Pro 


Thr 


Pro 


Val 


Leu 


Pro 


Glu 


Gly Arg Arg 










230 








235 










240 


Ser 


His 


Ser 


Ala 


Phe Gly Tyr 


Asn Gly Glu 


Leu 


Tyr 


He 


Phe 


Gly 










245 








250 










255 


Gly Tyr Asn Ala Arg Leu Asn Arg His 


Phe 


His 


Asp 


Leu 


Trp 


Lys 










260 








265 










270 


Phe 


Asn 


Pro 


Val 


Ser Phe 
275 


Thr 


Trp 


Lys 


Lys 
280 


He 


Glu 


Pro 


Lys 


Gly 
285 


Lys 


Gly 


Pro 


Cys 


Pro Arg 
290 


Arg 


Arg 


Gin 


Cys 
295 


Cys 


Cys 


He 


Val 


Gly 
300 


Asp 


Lys 


He 


Val 


Leu Phe 


Gly Gly Thr 


Ser 


Pro 


Ser 


Pro 


Glu 


Glu 










305 








310 










315 


Gly 


Leu 


Gly Asp 


Glu Phe 


Asp 


Leu 


He 


Asp 


His 


Ser 


Asp 


Leu 


His 










320 








325 










330 


He 


Leu 


Asp 


Phe 


Ser Pro 
335 


Ser 


Leu 


Lys 


Thr 
340 


Leu 


Cys 


Lys 


Leu 


Ala 
345 


Val 


He 


Gin 


Tyr 


Asn Leu 
350 


Asp 


Gin 


Ser 


Cys 
355 


Leu 


Pro 


His 


Asp 


He 
360 
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Arg Trp Glu Leu Asn Ala Met Thr Thr Asn Ser Asn lie Ser Arg 
365 370 375 

Pro lie Val Ser Ser His Gly 
380 

<210> 36 
<211> 287 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2676443CD1 



<400> 36 



Met Ala 


Ala 


Glu 


Ala 


Arg 


Val 


Ser 


Arg 


Trp 


Tyr 


Phe Gly Gly Leu 


1 






5 










10 










15 


Ala Ser 


Cys 


Gly Ala 


Ala 


Cys 


Cys 


Thr 


His 


Pro 


Leu 


Asp 


Leu 


Leu 








20 








25 










30 


Lys Val 


His 


Leu 


Gin 


Thr 


Gin 


Gin 


Glu 


Val 


Lys 


Leu 


Arg 


Met 


Thr 








35 










40 










45 


Gly Met Ala 


Leu 


Arg 


Val 


Val 


Arg 


Thr 


Asp 


Gly 


He 


Leu 


Ala 


Leu 








50 










55 










60 


Tyr Ser 


Gly 


Leu 


Ser 


Ala 


Ser 


Leu 


Cys 


Arg 


Gin 


Met 


Thr 


Tyr 


Ser 








65 










70 










75 


Leu Thr 


Arg 


Phe 


Ala 


He 


Tyr 


Glu 


Thr 


Val 


Arg 


Asp 


Arg 


Val 


Ala 






80 








85 










90 


Lys Gly 


Ser Gin Gly 


Pro 


Leu 


Pro 


Phe 


His 


Glu 


Lys 


Val 


Leu 


Leu 








95 










100 










105 


Gly Ser 


Val 


Ser 


Gly 


Leu 


Ala 


Gly Gly 


Phe 


Val 


Gly 


Thr 


Pro 


Ala 








110 










115 










120 


Asp Leu 


Val 


Asn 


Val 


Arg 


Met 


Gin 


Asn 


Asp 


Val 


Lys 


Leu 


Pro 


Gin 






125 








130 










135 


Gly Gin Arg Arg 


Asn 


Tyr 


Ala 


His 


Ala 


Leu Asp Gly 


Leu 


Tyr 


Arg 








140 










145 










150 


Val Ala 


Arg 


Glu 


Glu 


Gly Leu Arg 


Arg 


Leu 


Phe 


Ser 


Gly 


Ala 


Thr 








155 










160 










165 


Met Ala 


Ser 


Ser 


Arg 


Gly Ala 


Leu 


Val 


Thr 


Val 


Gly Gin 


Leu 


Ser 








170 










175 










180 


Cys Tyr 


Asp 


Gin 


Ala 


Lys 


Gin 


Leu 


Val 


Leu 


Ser 


Thr Gly Tyr 


Leu 








185 










190 










195 


Ser Asp 


Asn 


He 


Phe 


Thr 


His 


Phe 


Val 


Ala 


Ser 


Phe 


He 


Ala 


Gly 






200 










205 










210 


Gly Cys 


Ala 


Thr 


Phe 


Leu 


Cys 


Gin 


Pro 


Leu 


Asp 


Val 


Leu 


Lys 


Thr 








215 










220 










225 


Arg Leu 


Met 


Asn 


Ser 


Lys 


Gly 


Glu 


Tyr 


Gin 


Gly Val 


Phe 


His 


Cys 








230 










235 










240 


Ala Val 


Glu 


Thr 


Ala 


Lys 


Leu 


Gly 


Pro 


Leu 


Ala 


Phe 


Tyr 


Lys 


Gly 








245 






250 










255 


Leu Val 


Pro 


Ala 


Gly 


He 


Arg 


Leu 


He 


Pro 


His 


Thr 


Val 


Leu 


Thr 








260 










265 










270 


Phe Val 


Phe 


Leu 


Glu 


Gin 


Leu 


Arg 


Lys 


Asn 


Phe Gly 


He 


Lys 


Val 








275 










280 










285 



Pro Ser 



<210> 37 
<211> 497 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 3295764CD1 



<400> 37 

Met Asp Val Pro Gly Pro Val Ser Arg Arg Ala Ala Ala Ala Ala 
15 10 15 

Ala Thr Val Leu Leu Arg Thr Ala Arg Val Arg Arg Glu Cys Trp 
20 25 30 
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Phe 


Leu 


Pro 


Thr 


Ala 


Leu 


Leu 


Cys 


Ala 










35 








Leu 


Arg 


Pro 


Ser 


Glu 


Pro 


Phe 


Leu 


Thr 










50 










Asp 


Lys 


Asn 


Leu 


Thr 


Glu Arg 


Glu 


Val 










65 










Val 


Trp 


Thr 


Tyr 


Ser 


Tyr 


Leu 


Val 


Leu 










80 










Ala 


Thr 


Asp 


Tyr 


Leu 


Arg 


Tyr 


Lys 


Pro 










95 










Leu 


Ser 


Leu 


He 


Val 


Thr 


Trp 


Phe 


Met 










110 








Leu 


Leu 


Ala 


He 


Gin 


Phe 


Leu 


Glu 


Phe 










125 










Ala 


Thr 


Glu 


He 


Ala 


Tyr Tyr 


Ser 


Tyr 










140 










Leu 


Gly 


Met 


Tyr 


Gin 


Lys 


Val 


Thr 


Ser 










155 










Leu 


Val 


Gly 


Phe 


Thr 


Val 


Gly 


Ser 


Val 










170 










Ser 


Val 


Ala 


Gly Trp 


Ser 


Leu 


Phe 


Ser 










185 










Thr 


Cys 


Val 


Ser 


Val 


Ala 


Phe 


Ala 


Val 










200 










Pro 


Gin 


Lys 


Ser 


Leu 


Phe 


Phe 


His 


His 










215 










Arg 


Val 


Asn 


Gly 


He 


Lvs 


Val 


Gin 


Asn 










230 










Thr 


Pro 


Ala 


Ser 


Asn 


His 


Leu 


Pro Gly 










245 










Lys 


He 


Pro 


Leu 


Asn 


Met 


Glu 


Glu 


Pro 










260 










Pro 


Lys 


Pro 


Asp 


Arg 


Leu 


Leu 


Val 


Leu 










275 










Phe 


Leu 


Met 


Cys 


Tyr 


Ser 


Ser 


Arg 


Pro 










290 










Trp 


Trp 


Ala 


Leu 


Ser 


Thr 


Cys 


Gly Tyr 










305 










Thr 


Gin 


Gly 


Leu 


Trp 


Glu 


Lys 


Val 


Met 










320 










He 


Tyr 


Asn 


Gly Gly 


Val 


Glu 


Ala 


Val 










335 










Val 


Ala 


Val 


Phe 


Ala 


Val 


Gly 


Tyr 


He 










350 










Trp 


Gly 


Glu 


Met 


Thr 


Leu 


Ser 


Leu 


Phe 










365 










Ala 


Val 


Tyr 


He 


Met 


Asp 


Thr 


Val 


Gly 










380 










Ala 


Ser 


Tyr 


Val 


Val 


Phe 


Arg 


He 


He 










395 










He 


Ala 


Thr 


Phe 


Gin 


He 


Ala 


Ala 


Asn 










410 










Ala 


Leu 


Val 


Phe 


Gly 


Val 


Asn 


Thr 


Phe 










425 










Thr 


Leu 


Leu 


Thr 


Leu 


He 


Val 


Val 


Asp 










440 










Glu 


He 


Thr 


Thr 


Gin 


Phe 


Leu 


He 


Tyr 










455 








He 


Ala 


Val 


Val 


Phe 


Leu 


Ala 


Ser Gly 










470 










Lys 


Cys 


Arg 


Lys 


Leu 


Glu 


Asp 


Pro 


Gin 










485 










Thr 


Ser 


















PCT/USOO/16668 



Tyr 


Gly 


Phe 


Phe 


Ala 


Ser 


40 










45 


Pro 


Tyr 


Leu 


Leu 


Gly 


Pro 


55 










60 


Phe 


Asn 


Glu 


lie 


Tyr 


Pro 


70 










75 


Leu 


Phe 


Pro 


Val 


Phe 


Leu 


85 










90 


Val 


Val 


Leu 


Leu 


Gin 


Gly 


100 










105 


Leu 


Leu 


Tyr 


Ala 


Gin 


Gly 


115 










120 


Phe 


Tyr Gly lie 


Ala 


Thr 


130 










135 


lie 


Tyr 


Ser 


Val 


Val 


Asp 


145 










150 


Tyr 


Cys 


Arg 


Ser 


Ala 


Thr 


160 










165 


Leu 


Gly Gin 


lie 


Leu 


Val 


175 










180 


Leu 


Asn 


Val 


lie 


Ser 


Leu 


190 










195 


Ala 


Trp 


Phe 


Leu 


Pro 


Met 


205 










210 


He 


Pro 


Ser 


Thr 


Cys 


Gin 


220 










225 


Gly 


Gly 


He 


Val 


Thr 


Asp 


235 










240 


Trp 


Glu Asp 


He 


Glu 


Ser 


250 










255 


Pro 


Val 


Glu 


Glu 


Pro 


Glu 


265 










270 


Lys 


Val 


Leu 


Trp 


Asn 


Asp 


280 










285 


Leu 


Leu 


Cys 


Trp 


ser 


Val 


295 










300 


Phe 


Gin 


Val 


Val 


Asn 


Tyr 


310 










315 


Pro 


Ser 


Arg 


Tyr 


Ala 


Ala 


325 










330 


Ser 


Thr 


Leu 


Leu 


Gly 


Ala 


340 










345 


Lys 


lie 


Ser 


Trp 


Ser 


Thr 


355 










360 


Ser 


Leu 


Leu 


lie 


Ala 


Ala 


370 










375 


Asn 


lie 


Trp 


Val 


Cys 


Tyr 


385 










390 


Tyr 


Met 


Leu 


Leu 


lie 


Thr 


400 










405 


Leu 


Ser 


Met 


Glu 


Arg 


Tyr 


415 










420 


He 


Ala 


Leu 


Ala 


Leu 


Gin 


430 










435 


Ala 


Ser Gly Leu 


Gly 


Leu 


445 










450 


Ala 


Ser 


Tyr 


Phe 


Ala 


Leu 


460 










465 


Ala 


Val 


Ser 


Val 


Met 


Lys 


475 










480 


Ser 


Ser 


Ser 


Gin 


Val 


Thr 


490 










495 



<210> 38 
<211> 228 
<212> PRT 

<213> Homo sapiens 
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<220> 
















<221> misc_ 


feature 










<223> Incyte ID 


No: 


3438320CD1 




<400> 38 














Met Pro 


Arg 


Arg 


Gly 


Leu 


Val 


Al » 


Gly 


1 






5 








Gin Arg 


Arg 


Tvr 


Phe 


Th-r 

X 111. 


pro 


AT a 

Aia 


Glu 








20 










Pro Glu 


Asp 


Leu 


Trp 


Val 


Ser 


Tyr 


Leu 








35 








Thr Ser 


Leu 


nla 


Gin 


VjXU 


Tyr 


Lys 


Gly 








50 








He Val 


Glu 


val 


Ala 


oxy 


Cain 


Asp 


He 








65 








Lys Thr 


Arg 


Asp 


lie 


Arg 


Lys 


His 


He 








80 










Leu Arg 


Tyr 


Cys 


Thr 


Pro 


Arg 


Gly 


Arg 








95 










Gin Leu 


Pro 


Cys 


Ser 


Asp 


Trp 


Ala 


Asn 








110 










Trp Gin 


Gly 


Ser 


Tyr 


Tyr 


Glu 


Val 


Gly 








125 








Arg Ser 


He 


Arg 


lie 


He 


Asn 


Thr 


Leu 








140 










Leu Glu 


Val 


Gly 


Val 


Leu 


Glu 


Ser 


lie 








155 










Tyr Leu 


Pro 


Tyr 


Asn 


Ser 


His 


Ala 


Ala 








170 










Glu Gly 


Lys 


Asn 


Leu 


Asn 


Met 


Asp 


Phe 








185 








He Arg 


Asp 


Glu 


Glu 


Glu 


Glu 


Phe 


Asp 








200 








Thr Leu 


His 


Thr 


Pro 


Ala 


He 


Leu 


Leu 








215 










Thr Glu 


Leu 














<210> 39 














<211> 273 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_feature 










<223> Incyte ID 


No: 


3986488CD1 




<400> 39 














Met Ala 


Ala 


Thr 


He 


Met 


He 


Leu 


Tyr 


1 






5 








He He 


His 


Phe 


Pro 


Asp 


Phe 


Asp 


Lys 








20 










Phe Pro 


Leu 


Pro 


Leu 


Leu 


Tyr 


Val 


Gly 








35 








Ser Ser 


Thr 


Ser 


Lys 


Leu 


Ser 


Leu 


Pro 








50 










Lys Phe 


Thr 


lie 


Pro 


Leu 


Thr 


Leu 


Leu 








65 










Gly Lys 


Gin 


Tyr 


Ser 


Leu 


Asn 


lie 


lie 








80 










He Leu 


Gly 


Ala 


Phe 


He 


Ala 


Ala 


Gly 








95 








Leu Glu 


Gly 


Tyr 


He 


Phe 


Val 


Phe 


Leu 








110 










Ala Asn 


Gly 


Val 


Tyr 


Thr 


Lys 


Gin 


Lys 








125 










Gly Lys 


Tyr 


Gly 


Val 


Leu 


Phe 


Tyr 


Asn 








140 










Pro Thr 


Leu 


He 


lie 


Ser 


Val 


Ser 


Thr 
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Pro 


Asp 


Leu 


Glu 


Tyr 


Phe 


10 










15 


Val 


Ala 


Gin 


His 


Asn 


Arg 


25 










30 


Gly Arg Val 


Tyr 


Asp 


Leu 


40 










45 


Asn 


Leu 


Leu 


Leu 


Lys 


Pro 


55 










60 


Ser 


His 


Trp 


Phe 


Asp 


Pro 


70 










75 


Asp 


Pro 


Leu 


Thr 


Gly 


Cys 


85 










90 


Phe 


Val 


His 


Val 


Pro 


Pro 


100 










105 


Asp 


Phe Gly Lys 


Pro 


Trp 


115 










120 


Arg 


Leu 


Ser 


Ala 


Lys 


Thr 


130 










135 


Thr 


Ser 


Gin 


Glu 


His 


Thr 


145 










150 


Trp Glu 


lie 


Leu 


His 


Arg 


160 










165 


Ser 


Tyr 


Thr 


Trp 


Lys 


Tyr 


175 










180 


Thr 


Leu 


Glu 


Glu 


Asn 


Gly 


190 










195 


Tyr 


Leu 


Ser 


Met 


Asp 


Gly 


205 










210 


Tyr 


Phe 


Asn 


Asp 


Asp 


Leu 


220 










225 



Val 


Ser 


Lys 


Leu 


Asn 


Lys 


10 










15 


Lys 


lie 


Pro 


Val 


Lys 


Leu 


25 










30 


Asn 


His 


He 


Ser 


Gly 


Leu 


40 










45 


Met 


Phe 


Thr 


Val 


Leu 


Arg 


55 










60 


Leu 


Glu 


Thr 


He 


He 


Leu 


70 










75 


Leu 


Ser 


Val 


Phe 


Ala 


He 


85 










90 


Ser 


Asp 


Leu 


Ala 


Phe 


Asn 


100 










105 


Asn Asp 


lie 


Phe 


Thr 


Ala 


115 










120 


Met 


Asp 


Pro 


Lys 


Glu 


Leu 


130 










135 


Ala 


Cys 


Phe 


Met 


He 


He 


145 










150 


Gly Asp 


Leu 


Gin 


Gin 


Ala 



33/60 
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155 










Thr 


Glu 


Phe 


Asn 


Gin 
170 


Trp 


Lys 


Asn 


vai 


Leu 


Leu 


Ser 


Cys 


Phe 


Leu Gly 


Phe 


Leu 










185 










Leu 


Cys 


Ser 


Tyr 


Tvr 
200 


Asn 


Ser 


Ala 


Leu 


Ala 


He 


Lys 


Asn 


Val 


Ser 


Val 


Ala 


Tyr 










215 








Gly 


Asp 


Tyr 


He 


Phe 
230 


Ser 


Leu 


Leu 


Asn 


Cys 


Met 


Ala 


Gly 


Gly 


Leu 


Arg 


Tyr 


Ser 










245 








Gin 


Leu 


Lys 


Pro 


Lys 


Pro 


Val 


Gly Glu 










260 










Leu 


Lys 


Ser 















<210> 40 

<211> 206 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 4378816CD1 

<400> 40 



Met Gly Ala 
i 


Glu 


Trp 
5 


Glu 


Leu 


Gly Ala 


Leu 


Leu 


Cys 


Ala 


Ala 


Leu 


Leu 


Ala 


Ala 










20 










Arg 


Leu 


Gly Arg 


Gly 
35 


Gin 


Gly 


Ala 


Ala 


Trp 


Leu 


Cys 


Tyr 


Asp 


Ala 


Leu 


Val 


His 










50 










Phe 


Val 


Tyr 


Leu 


Ser 


Leu 


Val 


Gly Asn 










65 










Leu 


He 


Ala 


Ser 


Leu 


Trp 


Lys 


Glu 


Tyr 










80 








Trp 


Val 


Tyr 


Phe 


Asp 


Pro 


Thr 


lie 


Val 










95 










Val 


Ala 


Leu 


Asp 


Gly 


Ser 


Leu 


Ala 


Leu 










110 










Val 


Lys 


Glu 


Lys 


Tyr 


Tyr 


Arg 


His 


Phe 










125 










Val 


Cys 


Glu 


Leu 


Tyr 


Gly 


Cys 


Trp 


Met 










140 








Leu 


Thr 


Arg 


Ser 


Pro 


Asn 


Leu 


Asn 


Thr 










155 










Trp 


Leu 


Tyr 


Leu 


Phe 
170 


Phe 


Phe 


Asn Gly 


Gly Leu Leu Leu 


Trp 


Gin 


Ser 


Trp 


Leu 










185 








Gin 


Lys 


Glu 


Thr 


Ser 


Ser 


Val 


Lys 


Lys 



200 
<210> 41 
<211> 235 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 4797137CD1 

<400> 41 

Met Gin Gin Arg Gly Ala Ala Gly Ser 

1 5 
Pro Leu Leu Gly Val Leu Phe Phe Gin 




PCT/USOO/16668 



i fin 










ICC 


Val 


Phe 


He 


Leu 


Gin 


rne 


j. f ~j 










ion 
loU 






Tyr 


Ser 


Thr 


Val 


190 










1 Q 1 ! 


Thr 


Thr 


Ala 


Val 


Val 


oiy 


205 










210 


He 


Gly 


He 


Leu 


He 


Gly 


220 










225 


Phe 


Val 


Gly Leu Asn 


He 


235 










240 


Phe 


Leu 


Thr 


Leu 


Ser 


Ser 


250 










255 


Glu 


Asn 


He 


Cys 


Leu 


Asp 


265 










270 



Glu 


Ala 


Gly Gly 


Ser 


Leu 


10 










15 


Gly 


Cys 


Ala 


Leu 


Gly 


Leu 


25 










30 


Asp 


Arg 


Gly Ala 


Leu 


He 


40 










45 


Phe 


Ala 


Leu Glu Gly 


Pro 


55 










60 


Val 


Ala 


Asn 


Ser 


Asp 


Gly 


70 










75 


Gly 


Lys 


Ala 


Asp 


Ala 


Arg 


85 










90 


Ser 


Val 


Glu 


He 


Leu 


Thr 


100 










105 


Phe 


Leu 


He 


Tyr 


Ala 


He 


115 










120 


Leu 


Gin 


He 


Thr 


Leu 


Cys 


130 










135 


Thr 


Phe 


Leu 


Pro 


Glu 


Trp 


145 










150 


Ser 


Asn 


Trp 


Leu 


Tyr 


Cys 


160 










165 


Val 


Trp 


Val 


Leu 


He 


Pro 


175 










180 


Glu 


Leu 


Lys 


Lys 


Met 


His 


190 










195 


Phe 


Gin 











205 



Arg Gly Cys Ala Leu Phe 
10 15 
Gly Val Tyr lie Val Phe 



34/60 
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20 










Ser Leu 


Glu 


lie 


Arg 


Ala 


Asp 


Ala 


His 








35 








Glu Lys 


tI- 
116 


Lys 


Leu 


Lys 


Cys 


Thr 


Phe 








50 










Thr Asp 


Lys 


Leu 


Thr 


lie 


Asp 


Trp 


Thr 








65 






Ser His 


Thr 


Val 


Ser 


lie 


Phe 


His 


Tyr 








80 








Thr Thr 


Ala 


Gly Thr 


Phe 


Arg 


Asp Arg 








95 










T T a 1 ITU 

Val Tyr 


Lys 


Gly Asp 


Ala 


Ser 


He 


Ser 








110 










Lys Asp 


Asn 


Gly Thr 


Phe 


Ser 


Cys 


Ala 








125 










Val His 


His 


Asn 


lie 


Pro 


Met 


Thr 


Glu 








140 










Gly Phe 


Gly 


Thr 


Met 


Leu 


Ser 


Ser 


Val 








155 










Val Phe 


Val 


Pro 


Ser 


Ala 


Val 


Val 


Val 








170 










Met Gly 


Arg 


Lys 


Ala 


Ala 


Gly 


Leu 


Lys 








185 








Tyr Lys 


Lys 


Ser 


Ser 


He 


Glu 


Val 


Ser 








200 










Glu Glu 


Glu 


Ala 


Cys 


Met 


Ala 


Arg 


Leu 








215 








Cys Leu 


Asp 


Ser 


Asp 


Tyr 


Glu 


Glu 


Thr 








230 










<210> 42 














<211> 147 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> raisc_: 


Eeature 










<223> Incyte ID 


No: 


5470806CD1 




<400> 42 














Met Ala 


Ser 


Leu 


Arg 


Leu 


Phe 


Leu 


Leu 


1 






5 










Phe Ala 


Ser 


Glu 


Ala 


Gly 


Pro 


Gly Gly 








20 










Pro Leu 


Met 


Val 


Lys 


Val 


Leu 


Asp Ala 








35 










Val Asp 


Val 


Ala 


Val 


Lys 


Val 


Phe 


Lys 








50 








Trp Glu 


Pro 


Phe 


Ala 


Ser 


Gly 


Lys 


Thr 








65 










His Gly 


Leu 


Thr 


Thr 


Asp 


Glu 


Lys 


Phe 








80 










Val Glu 


Leu 


Asp 


Thr 


Lys 


Ser 


Tyr 


Trp 








95 










Pro Phe 


His 


Glu 


Tyr 


Ala 


Glu 


Val 


Val 








110 










Gly His 


Arg 


His 


Tyr 


Thr 


He 


Ala 


Ala 








125 










Tyr Ser 


Thr 


Thr 


Ala 


Val 


Val 


Ser 


Asn 








140 










<210> 43 














<211> 147 














<212> PRT 














<213> Homo sapiens 










<220> 
















<221> misc_feature 










<223> Incyte ID 


No: 


5473242CD1 






PCT/USOO/16668 



25 










30 


Val 


Arg Gly 


Tyr 


Val 


Gly 


40 










45 


Lys 


Ser 


Thr 


Ser 


Asp 


Val 


55 










60 


Tyr 


Arg 


Pro 


Pro 


Ser 


Ser 


70 










75 


Gin 


Ser 


Phe 


Gin 


Tyr 


Pro 


85 










90 


He 


Ser 


Trp 


Val 


Gly 


Asn 


100 










105 


He 


Ser 


Asn 


Pro 


Thr 


He 


115 










120 


Val 


Lys 


Asn 


Pro 


Pro 


Asp 


130 










135 


Leu 


Thr 


Val 


Thr 


Glu 


Arg 


145 










150 


Ala 


Leu 


Leu 


Ser 


He 


Leu 


160 










165 


Ala 


Leu 


Leu 


Leu 


Val 


Arg 


175 










180 


Lys 


Arg 


Ser 


Arg 


Ser 


Gly 


190 










195 


Asp 


Asp 


Thr 


Asp 


Gin 


Glu 


205 










210 


Cys 


Val 


Arg 


Cys 


Ala 


Glu 


220 










225 


Tyr 












235 












Cys 


Leu 


Ala 


Val 


Leu 


He 


10 










15 


Ala 


Gly Glu 


Ser 


Lys 


Cys 


25 










30 


Val 


Arg Gly 


Ser 


Pro 


Ala 


40 










45 


Lys 


Thr 


Ala 


Asp 


Gly 


Ser 


55 










60 


Ala 


Glu 


Ser 


Gly 


Glu 


Leu 


70 










75 


Thr 


Glu 


Gly 


Val 


Tyr 


Arg 


85 










90 


Lys 


Ala 


Leu 


Gly 


He 


Ser 


100 










105 


Phe 


Thr 


Ala 


Asn 


Asp 


Ser 


115 










120 


Leu 


Leu 


Ser 


Pro 


Tyr 


Ser 


130 










135 


Pro 


Gin 


Asn 









145 
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<400> 43 



Met 


Val 


His 


Leu 


Thr 


Asp 


1 








5 




Trp 


Gly 


Lys 


Val 


Asn 
20 


Pro 


Ser 


Leu 


Leu 


He 


Val 
35 


Tyr 


Phe 


Gly 


Asp 


Leu 


Ser 
50 


Ser 


Val 


Lys 


Ala 


His 


Gly 
65 


Glu 


Leu 


Lys 


His 


Leu 


Asp 
80 


Asn 


Glu 


Leu 


His 


Cys 


Asp 
95 


Lys 


Leu 


Leu 


Gly Asn 


Met 


He 










110 




Lys 


Glu 


Phe 


Thr 


Pro 
125 


Ser 


Ala 


Gly 


Val 


Ala 


Ser 
140 


Ala 



<210> 44 
<211> 2701 
<212> DNA 

<213> Homo sapiens 



Ala 


Glu 


Lys 


Ala 
10 


Thr 


Val 


Val 


Glu 


He 


Gly 
25 


Ala 


Glu 


Pro 


Trp 


Thr 


Gin 
40 


Arg 


Tyr 


Val 


Ser 


Ala 


He 
55 


Met 


Gly 


Lys 


Val 


He 


Asn 
70 


Ala 


Phe 


Leu 


Lys 


Gly Thr 


Phe 


Ala 








85 






Leu 


His 


Val 


Asp 
100 


Pro 


Glu 


Val 


He 


Met 


Met 


Gly His 








115 






Ala 


Gin 


Ala 


Ala 
130 


Phe 


Gin 


Leu 


Ala 


His 


Lys 
145 


Tyr 


His 




PCT/US00/16668 



Asn 


Gly 


Leu 






15 


Ser 


Leu 


Ala 






30 


Phe 


Ser 


Lys 






45 


Asn 


Pro 


Gin 






60 


Asp 


Asp 


Gly 






75 


Ser 


Leu 


Ser 






90 


Asn 


Phe 


Arg 






105 


His 


Leu 


Gly 






120 


Lys 


Val 


Val 






135 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 264114CB1 



<400> 44 

gcggcggcgc cagcttcctc ggccggaggg 
gagggccacg cccccgacgc cgcgccggag 
agcggatccc acacctccgg accgagggac 
ggatgttgcc acaaaatcta cctcgtgtat 
gcagatttga gcacaatgtc tgcagactgt 
caggatgacc taggagtgat tctaagtctg 
gtcagtcttg gtactgaatg ctgcagataa 
tattcacttg accttccatc agtaagacgg 
tggaatacag tcatgtccac tcaagacgag 
ttgttggaac agttgaaact gttttatgaa 
gttgagggca ctgaattccc ttgtcataag 
agggccatgt ttatgagtgg actaagtgaa 
gtcgatgctg ccaccttaca gataataata 
aatgacagca ctgtagaaca gctttatgaa 
ttacaacgtt gtcgagaata tttaattaaa 
ttgagttttg ctgatctctt cagttgtgag 
gagcacaagt tcactgctgt gtatcatcag 
ctgatagata ttctcagtag tgacaattta 
gctgctatgc tgtggctaga gtataacaca 
cttagccaaa tcagaattga tgcactttca 
ctgccaccca atgataagtc agtggtggtt 
ttcaaaccaa gacttgggat gactaaagag 
gaaaatcctt gtagtcttta ctcttctgtc 
aagttatgta gcccaccagc tgatttgcat 
gatatctaca tagcaggggg tcaagttcct 
acaagcaaac ttcagactgc cttcagaact 
caaaatacct ggtttccaaa gaccccaatg 
tgctgtgaag gctatatcta tgcaattgga 
aggaccgtag aaagatacga cactgagaaa 
tgtgcttggc aatggagtgc agcagttgtg 
aacctcatgt actgttattt tccaaggtct 
actagtaggt cctttgcttc agctgcagct 
ttgcatattg ctaccaattc cggcataaga 
gtaactgtgg aaatttatga tgtgaataaa 
gctaagaggt actctgaccc ctgtgttaga 
tttatgcgag aaacccactt aaatgagcga 
gaacttgacc ggtggtctct gcggcagcat 



gaggcgagac cccaggcgag gccgcggcgg 60 
gggcccagtg tggacggggc caccggctgg 120 
gcggttactc cacaggatcc gctgaacata 180 
ttttctcttt cactcatgag ctgcacaatt 240 
gttgaaaaac tctgaagaac ctaattaaca 300 
tgtaacaaga tattactcat tagtgaatgt 3 60 
cagcaagtag gttctccttt atttctgaag 420 
acttttctaa tctgttcctg gagatattaa 480 
aggcagatca atactgaata tgctgtgtca 540 
cagcagttgt ttactgacat agtgttaatt 600 
atggttcttg caacatgtag ctcttatttc 660 
agcaaacaaa cccatgtaca cctgaggaat 720 
acttatgcat acacgggtaa cttggcaatg 780 
acagcttgct tcctacaggt agaagatgtg 840 
aaaataaatg cagagaattg tgtacgattg 900 
gaattaaaac agagtgctaa aagaatggtg 960 
gacgcgttca tgcagctgtc acatgaccta 1020 
aatgtagaaa aggaagaaac cgttcgagaa 1080 
gaatcacgat cccagtattt gtcttctgtt 1140 
gaagtaacac agagagcttg gtttcaaggt 1200 
caaggtctgt ataagtccat gcccaagttt 1260 
gaaatgatga ttttcattga agcatcttca 1320 
tgttacagcc cccaagcaga aaaagtttac 1380 
aaggttggga ccgttgtaac tcctgataat 1440 
ctgaaaaaca caaaaacaaa tcacagtaaa 1500 
gtgaattgct tttattggtt tgatgcacag 1560 
ctttttgtcc gcataaagcc atctttggtt 1620 
ggagatagcg taggtggaga acttaatcgg 1680 
gatgagtgga cgatggtaag ccctttacct 1740 
gttcatgact gcatttatgt gatgacactg 1800 
gactcatggg tagaaatggc catgagacag 1860 
tttggtgata aaattttcta tattggaggg 1920 
ctcccctctg gcactgtaga tgggtcttca 1980 
aatgagtgga aaatggcagc caacatccct 2040 
gctgttgtga tctcaaattc tctatgtgtg 2100 
gctaaatacg tcacctacca atatgacctg 2160 
atatctgaac gtgtactgtg ggacttgggg 2220 
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agagattttc 
aaaccaccaa 
atggttgcac 
ctttgtcatt 
aattctatct 
tatatcagtg 
ttgagaaact 
ccctgtgtat 
t 



gatgcactgt 
cttatctttt 
taccacctgt 
ttctttgcta 
ttgataaatt 
agtcttacag 
ttatgtgtaa 
gtacagttca 



ggggaaactc 
ttcaacggat 
atagtgggga 
aacaaaagag 
ttatttttat 
aaagatatgc 
tcatgagagt 
aaaagttcat 



tatccatcct 
gggacagaag 
agttcaggga 
gctatgaaag 
gccctactta 
ttccataata 
ataagaatct 
ttataaaagt 



gccttgaaga 
agtttgaact 
gtgcacgcct 
aactaaatat 
atatttgcat 
tgaaatagat 
ggattatcta 
agtttcctgt 



gtctccatgg 
ggatggagaa 
gagttatgtg 
gagtacataa 
cagtataata 
tattcaataa 
acattgttag 
tcctagttga 



2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2701 



<210> 45 

<211> 736 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1455669CB1 



<400> 45 

gagacttagc gacagacaga 
ctgctgagcc ctgcgcgggg 
ccagcctgct gagcctgtgc 
tctgggtcct tggtcccatc 
ggatgtcccc actcttcaaa 
gcaatgttcg ggggaggcag 
tcctatcggg gtgtagaagg 
gaggtcaact cagtggaaca 
catgcgcaaa catacgcata 
ggcattttcc aaaacatacc 
ttaggatgaa ccttttaaaa 
ataggacaat ttttatatgt 
tccaagtgtg aaaaaa 

<210> 46 
<211> 1826 
<212> DNA 

<213> Homo sapiens 



cgctgggacc cacgacgaca 
cagggggtct ggaaccagac 
ttcctgagaa cagcaggggt 
tacctcctct tcatccacca 
gccaagatgg tagctgccat 
gggacaggct ggaacctggt 
gaatctgtta atcaaacaga 
cggttctccc aaacagattt 
cactcccatg ttcctggaca 
atgtagactc ttggatacac 
agttatgcat ttatttttat 
gatatgtatt taccttagtg 



gaaggcgccg atggccgcgc 60 
agagcctgaa cctgccgcca 120 
ctgggtaccc cccatgtacc 180 
ccatggccgg ggctacctcc 240 
ccctgggagc ctggaaccag 300 
gaagtcttaa agtagactcc 360 
gcaatattag aaaggctaca 420 
tgtaattccg aaaatccacg 480 
gtttatagct accataacct 540 
aaggtaattt tagagccaca 600 
gttcccccac tggctgtatt 660 
tgttaaataa acactggcat 720 

736 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2084989CB1 



<400> 46 

ccttaggcgc cagggacagc cgagcgttac ctggtcccgg gcagcggagt tctttaccca 60 

ccccagttct ggttctgacg ccctagctca ttccgcaaat ttagggcttg ggtctggctt 120 

gttcccctcc ggctcgaacc acctcttctc tgagccgagc cagctaccgg ggctcctgga 180 

attgccaccc ctccctgggc acccttgagg cctccgtgga gggacgtcac ggggcagagc 240 

gggacgtgag cctgagtttg ctgcaggcgt gctctgtgtg gtggctgggt tctgccaatc 300 

cccgtgccca ccgggtgggc gcggccggga agctcctgcc cctccctgct ggtcggcgtc 3 60 

acgcgtgacg tcccgcgtga tggctgggag ggcccggcgg cgacagcgga ggcagagagg 420 

aaggcggttc tgagagcttc agagagcgat ggaaagcaaa atgggtgaat tgcctttaga 480 

catcaacatc caggaacctc gctgggacca aagtactttc ctgggcagag cccggcactt 540 

tttcactgtt actgatcctc gaaatctgct gctgtccggg gcacagctgg aagcttctcg 600 

gaacatcgtg cagaactaca gggccggcgt ggtgacccca gggatcaccg aggaccagct 660 

gtggagggcc aagtatgtgt atgactccgc cttccatccg gacacagggg agaaggtggt 720 

cctgattggc cgcatgtcag cccaggtgcc catgaacatg accatcactg gctgcatgct 780 

cacattctac aggaagaccc caaccgtggt gttctggcag tgggtgaatc agtccttcaa 840 

tgccattgtt aactactcca accgcagtgg tgacactccc atcactgtga ggcagctggg 900 

gacagcctat gtgagtgcca ccactggagc tgtggccacg gccctgggac tcaaatccct 960 

caccaagcac ctgcccccct tggtcggcag atttgtgccc tttgcagcag tggcagctgc 1020 

caactgcatc aacatccccc tgatgaggca gagagagctg caggtgggca tcccggtggc 1080 

tgatgaggca ggtcagaggc ttggctactc ggtgactgca gccaagcagg gaatcttcca 1140 

ggtggtgatt tcaagaatct gcatggcgat tcctgccatg gccatcccac cactgatcat 1200 

ggacactctg gagaagaaag acttcctgaa ggtaggcgac tgtacctctc ttgtcctgga 1260 

atgggcgatg gctgggagaa gtgaccaggc cccaactctc tctccagcct cgcctgattc 1320 

tctaagactt gccagccctt ctcctgaccc ctgcaccgcc tcctccacct tcgttcattc 13 80 

agcaagaatg aactgggctg gggtgaagga actctgcagg ggcaggagga gaggacaaag 1440 
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gaaggaaacc 
catagtcatg 
tacacacggg 
caagctctgg 
gcatgtttta 
aagtttatag 
agaaagtgta 



aacttcatca 
cttgatctca 
tgaacacgca 
ctcattcaca 
gggctgagga 
tctgatgcag 
aaaataaaaa 



gtgttactcc 
tgctcatcct 
cacacagaca 
gaatcattta 
gatggcactg 
aaaccaataa 
aaaaaa 



agtggcttct 
tttaccccct 
tgaacaggac 
ttcacaaatg 
aacacaatgg 
acaaggaggc 



gacacacaga 
agtgcctcca 
acgaaagcaa 
tattgagtgc 
ttatggcccc 
acccacataa 



aggggactgt 
tactgagagg 
agcacaggaa 
catgcaccag 
tgtcctcatg 
atacattctt 



1500 
1560 
1620 
1680 
1740 
1800 
1826 



<210> 47 

<211> 1325 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2501034CB1 



<400> 47 

ccacgggtcc 

tgcagcaggg 

cctgccagta 

accacggtgt 

tcgagtatga 

gacatattct 

tttgtgaagc 

atgacttgat 

tagaacaaac 

cggcaaaaaa 

gaaaactctt 

aactcattaa 

tcaaccgtaa 

tctacttctt 

aggacttctg 

agagaaaaga 

gattggaaac 

aaatgcaccc 

ccacatggac 

atacgttata 

ttcattcctt 

tttgtgttct 

aaaaa 



ggttctggac 
gaaggggaac 
caggtgcatc 
gctgaagata 
tcctgttatc 
tttcaatact 
atacctgagg 
cagatatgct 
tgcctatttt 
aaagcccaac 
gctgattcca 
agtatttgtt 
gctctccttc 
ccagagtatg 
aagagtttta 
catgaaatat 
actttctgaa 
aagagacaag 
acaaggaatg 
cacaaggcca 
tgacatgttt 
tacaatagaa 



tgcagttgag 
gtggatgggg 
gaatgcaacc 
accatctgta 
atcttgatta 
caaataaata 
tggtggcagc 
aaggaatggg 
attggcattt 
ttcattttgc 
gctgtcattt 
cttacatcaa 
ttggccgtgt 
gaatgggatg 
ttcttcttca 
aaaccaacct 
aaagaaagca 
ccatttacat 
ttgctgcaga 
ggtaagcgct 
atatcttttt 
acgcttttaa 



tggaaatggg 
tggcagcgac 
aggaggccaa 
aatcctgcca 
atgctatatt 
tccatggaaa 
ttcaagattc 
atttctatag 
ttaccttcct 
tgctgaaagc 
gggaacatga 
attttcaggc 
tgagtggctt 
ttggaagtga 
ctatctgtgg 
cctcatttct 
atgataatag 
acagatattc 
gactgaatga 
cataattcac 
aatttaaatg 
taaagtcttt 



caacggcggg 
tcctactgct 
agagttgtac 
gaaacctgta 
gtgcaaagct 
actctgcata 
caaccagaat 
aatgtttgcg 
gtgggtagaa 
attattatta 
ctacacatct 
aattagagtg 
actgctggaa 
ttatgccatc 
catgaccagc 
gttgagtaaa 
cggtggatac 
acagtcacac 
catgcaacag 
acataataaa 
ttgttactgg 
cagaataaac 



cggagcggcc 
gcctcggcct 
cgagactata 
gacaaatata 
caggcctaca 
ttttgtttgc 
actgcccctg 
attgctgctt 
cggcccatga 
tctagctacg 
gtgtgcctca 
accctaaaca 
agcatcatgg 
tttaaatctc 
tgtatctgaa 
atgaagcaaa 
ccacccccac 
atagaaacac 
gtgaaggttt 
acatctaggt 
cttaaaatat 
caaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1325 



<210> 48 
<211> 1832 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2745212CB1 



<400> 48 

tgggctgtcg 

gctggcaaat 

gttgggtggg 

cgaatcactt 

gagtgctgtg 

aggacaaaca 

gggttccgga 

ttcaaaaagc 

agaagacatc 

aaacacctgg 

aaagggcccg 

ttaaagtaaa 

ccaggcaaaa 

attatgcaaa 

aagttgtggc 

aatttgagag 



ttggctggag 
ccggcccagg 
gtagagagta 
ataaatggcg 
ctttcatggg 
agtgaaatac 
gagcagagta 
caatcaggag 
tggtctgcaa 
aaatggagat 
ggtagatcct 
gattcctgaa 
acagctcttt 
ttacaagaaa 
agggataaaa 
accacagtat 



cagcggctgc 
atgtagagct 
gggggcggta 
ccgaagcagg 
cctcttcttt 
ttcatacatt 
ctcaaatacg 
cagtatgcag 
cagaaaaatg 
ggtggcagta 
actgttgaaa 
gagctaaaac 
tatcttcctg 
tctcgtggaa 
gaatacttca 
gctgaaattc 



gcgggtcgcg 
ggcagtgcct 
gtcgggggtg 
acccgaagcc 
atgaagcaaa 
acagtggttg 
tggacaccaa 
aggggaagat 
ttgaagtgaa 
ccagtgagac 
atgaggaaac 
cgtggcttgt 
ccaagaagaa 
acacagataa 
acgtaatgtt 
ttgcagatca 



gtgctgtgag 
gacggcgcgt 
gtgggagaag 
taaattccag 
gtgtgtaaag 
gaataaaaat 
tttgcagaaa 
gagaggggct 
aacgaaaaag 
ccctcagcct 
attcatgaac 
tgatgactgg 
tgtggattcc 
taaggagtat 
gggtacccag 
tcccgatgca 



gtctgcgggc 60 
ctgacgcgga 120 
gaggaggcgg 180 
gagggtgagc 240 
gttgccataa 3 00 
tgggatgaat 3 60 
cagcgagaac 420 
gccccaggaa 480 
aacaaacaga 540 
cctcggaaga 600 
agagttgaag 660 
gacttaatta 720 
attcttgagg 780 
gcggttaatg 840 
ctactctata 900 
cccatgtccc 960 



38/60 




WO 00/78953 PCT/US00/16668 



aggtgtatgg agcgccacat ctcctgagat tatttgtacg aattggagca atgttggctt 1020 
atacacctct ggatgagaag agccttgctt tattactcaa ttatcttcac gatttcctaa 1080 
agtacctggc aaagaattct gcaactttgt tcagtgccag cgattatgaa gtggctcctc 1140 
ctgagtacca tcggaaagct gtgtgagagg cactctcact cacttatgtt tggatctccg 1200 
taaacacatt tttgttctta gtctatctct tgtacaaacg atgtgctttg aagatgttag 1260 
tgtataacaa ttgatgtttg ttttctgttt gattttaaac agagaaaaaa taaaaggggg 1320 
taatagctcc ttttttcttc tttctttttt tttttcattt caaaattgct gccagtgttt 1380 
tcaatgatgg acaacagagg gatatgctgt agagtgtttt attgcctagt tgacaaagct 1440 
gcttttgaat gctggtggtt ctattccttt gacactacgc acttttataa tacatgttaa 1500 
tgctatatga caaaatgctc tgattcctag tgccaaaggt tcaattcagt gtatataact 1560 
gaacacactc atccatttgt gcttttgttt ttttttatgg tgcttaaagt aaagagccca 1620 
tcctttgcaa gtcatccatg ttgttactta ggcattttat cttggctcaa attgttgaag 1680 
aatggtggct tgtttcatgg tttttgtatt tgtgtctaat gcacgtttta acatgataga 1740 
cgcaatgcat tgtgtagcta gttttctgga aaagtcaatc ttttaggaat tgtttttcag 1800 
atcttcaata aattttttct ttaaatttca aa 1832 



<210> 49 
<211> 1211 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4833111CB1 



<400> 49 

gacagacccg 

cggcttggat 

agcgctggtg 

taccattaaa 

aatatatgct 

tatcacctat 

tatgaaacat 

atctgaagtg 

ctctaacatc 

tttaagagag 

ctggtcctgg 

tgcaggtgga 

gctggcaaag 

ctggcggtca 

cagtctggga 

agttggcaga 

gggcctgcag 

gaagaccagt 

tggtatgaag 

ggtactctga 

aaaaaaaaaa 



cctcaaacat 
tgtagccttg 
gctggtgggg 
accaggctgc 
ggcgttcctt 
gaatatgtga 
atgttggctg 
gttaagcaga 
tcatatgaag 
attccttttt 
aggcaggatc 
tttgccgctg 
gctggctcca 
caggggctgg 
ggtttcatct 
aagagtcctt 
tgcaaaccct 
tgtgctaaga 
tcattggcct 
acaatttcct 



ggcggcgccc 
acgaggtctg 
tagcaggtgt 
agagtcccca 
ctgctgctat 
agtggttttt 
cctctgctgg 
gggcacaggt 
agggtatcca 
ctttggtcca 
atgtggtgga 
cagtcaccac 
gcactgctga 
caggattatt 
ttctgggggc 
gaagcagaga 
cttccgctga 
taccggcatg 
gtatgccaga 
cagaacctct 



agcgcgcgag 
agcgaccatg 
ttctgttgac 
aggatttagt 
tggatccttt 
gcatgctgat 
agaagtggtt 
atctgcttct 
agggttgtat 
gtttccctta 
ttcttggcag 
ccctctagac 
tgggaatgtg 
tgcaggtgtc 
ttatgaccga 
caagcctcac 
gcagctgtct 
gagattgtgc 
gagctaagag 
taataaataa 



gacgtgatcc 
gaccggccgg 
ttgatattat 
aaggctggtg 
cctaatgctg 
tcatcttcgt 
gcctgcctga 
acaagaacat 
cgaggctata 
tgggagtcct 
tcagcagtct 
gtggcaaaga 
ctctctgtcc 
ttccctcgaa 
acgcacagct 
ctccacttct 
gaactatagg 
catccgtggt 
aagaaaacgg 
gtttggtaat 



gcttctgctc 
ggttcgtggc 
ttcctctgga 
gttttcatgg 
ctgcattttt 
atttgacacc 
ttcgagttcc 
ttcagatttt 
aaagcacagt 
taaaagccct 
gtggagcttt 
caagaattac 
tgcatggggt 
tggcagccat 
tgctgttgga 
gtcaagagag 
ccccagtgct 
ataggctggc 
ggtctgtggc 
gctgagaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1211 



<210> 50 

<211> 1046 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 876677CB1 



tcatactttc tcctctcccc tctcccaagc 60 
tagctccaaa cccatgaaaa attgccaagt 120 
tagggtgtct tcacctgaga agcaagataa 180 
gcttggtgta tgtggctgga tcctgttttc 240 
ccccatctcc atatggatgt gcttgaagat 300 
ccgtctggga cgcatccaag ctgacaaagc 3 60 
atgcatagat gtgtttgtca aagttgacct 420 
agagatcctc accagagact ccgtaactac 480 
ctatagtgct gtctcagcag tggctaatgt 540 
ggctcaaacc actctgagaa atgtcttagg 600 



<400> 50 

cccacgcgtc 

acatctgagt 

ataaaagctt 

agagaatttc 

cctctctttc 

cattaaggag 

caaggggcca 

ccgaacagtt 

tcaggtagat 

caacgatgtc 



cgggaatgtc 
tgctgcctgt 
ctcaagaatg 
gtgggtgtca 
ctgttggtga 
tatgaacgtg 
ggtttgatcc 
acttgcaaca 
ggagttgtct 
catcaagcaa 



tttcactctc 
tcttcacact 
agatggattc 
acaataaacg 
tcattacctt 
ctgttgtatt 
tggtcctgcc 
ttcctccaca 
attacagaat 
catttctgct 



39/60 



WO 00/78953 



PCT7US00/16668 



gacacagacc 
tttacttgat 
tgttcggatt 
agcgagagcc 
agcctccatg 
gagcacggta 
agagggcatt 
tcctcttgcg 



ttgtcccaga 
gatgccaccg 
cccgtgcagt 
aaggtccttg 
gtgctggctg 
gccaccgaga 
ggtggcgtca 
gtagtcagct 



tcttagctgg 
aactgtgggg 
tgcagagatc 
cagctgaagg 
agtctcccat 
agaattctac 
gctatgataa 
attgca 



acgagaagag 
gatccgggtg 
catggcagcc 
agaaatgaat 
agctctccag 
gattgtgttt 
ccacaagaag 



atcgcccata 
gcccgagtgg 
gaggctgagg 
gcttccaaat 
ctgcgctacc 
cctctgccca 
cttccaaata 



gcatccagac 660 
aaatcaaaga 720 
ccacccggga 780 
ccctgaagtc 840 
tgcagacctt 900 
tgaatatact 960 
aagcctgagg 1020 
1046 



<210> 51 
<211> 1660 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2326143CB1 



<400> 51 

gctcccctgc 

cggcgcgggc 

gccgccaccg 

aaaggaaatt 

gcatcttgtt 

tgcatattaa 

cagccacagt 

caagaacatg 

gcagcgttac 

cttgaacaca 

atgtttcacc 

actagcagtt 

gaagccatgg 

ggaggacagc 

ctgcagctag 

gcaacccggc 

accaacatag 

acaaggttga 

gcagaccgca 

tcatcctcag 

gatattatgc 

aatgagcctc 

ctgtatttgc 

caggtctgtc 

taatcatttc 

ggtgatctct 

actgaagagg 

aataaaaact 



ccaccccgcc 
agccgtgacc 
tctgaactag 
ttcgaggtcg 
atgaaagtgg 
caagggtaga 
cttttacttg 
ttacttctga 
ctggaggcga 
gagcccctag 
ctggccgggg 
ctactggtgg 
atcctatagc 
ttaattcctc 
aacggcagca 
gtactaacac 
ctaatacaga 
atgatcctaa 
gcctgtttgt 
atgaggatga 
ctttagatgt 
caccacctcc 
caatgaaagt 
actcttgtta 
cacttaacta 
gcttcctgta 
aaaattgaaa 
gcttggatgg 



cccgtggccg 
cccgcggggg 
gatgtcccga 
cagatataag 
tgcaacaaca 
ttttgattta 
tccctattgt 
acatgcagaa 
tcctaatcat 
agatttagat 
attaggaggt 
actttcttct 
tgagctttta 
tggcccttcc 
tgcccaggca 
aagcagtgtc 
aagcagtcag 
aatgtctgaa 
ccaagagctc 
tcggggggag 
tgctttagaa 
tctttgatga 
ggacaacaac 
cattgtgtac 
atttttactt 
ccttgacatg 
tctctaatga 
tttaccttaa 



agcccgggag 
acactgcagc 
catgaaggtg 
tgtttaattt 
acaaggcata 
tactatggtg 
ggaaaaatgg 
acatcaacag 
gtcacggatg 
gaatcgagtg 
cctcgtgctc 
tctcagagtt 
tctcagttat 
gcttctcagt 
gcacggcaac 
accactacaa 
cagactctac 
acggagcgcc 
cttctgtcca 
atggcagatt 
aacctaaatt 
catcccaatt 
tatcttgggt 
attcaaaagg 
ctagcaggta 
caaaaggctc 
agctgctgtg 
aaaaaaaaaa 



tcgagtggga 
cggagcccgg 
tcagctgtga 
gctacgatta 
caactgacca 
gggaagcttt 
gctatacgga 
aagtgatttg 
actttgcagc 
gtgttcgaca 
gtagatcaaa 
catattctcc 
caggagtgag 
tacaacaact 
aactggagac 
tcacacaatc 
agaattccca 
agtccatgga 
ctttagtgcg 
ttggtgctat 
taaaagagag 
cgcagacaat 
ttgtttggtg 
aagagagaaa 
aatgtaggta 
tcctaatact 
tgtatttatg 



gtcggccggc 
gaggggccgc 
tgcatgttta 
cgatctttgt 
cccaatgcag 
ctctgtagag 
gacatctctt 
tccaatatgt 
tcatcttaca 
tgtacgtaga 
catgcacttt 
aagcaatagg 
acgttctgca 
gcagatgcag 
cgcacgcaac 
cacagcaaca 
gtttctttta 
aagcgagcgt 
tgaagagagc 
gggctgtgta 
taataaagga 
gtcctctgtg 
attgtaattt 
atatatatga 
gcagtgcagg 
ccacattcaa 
aatattaatg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1660 



<210> 52 
<211> 1110 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 2786302CB1 



<400> 52 

cctttattag agaaagagtg acctatctac 
cagtttcttc tgtcttctgc tcctggacat 
ggataaagaa taactctagg acaagagttg 
tttagatttg gctttaaaag cccatactat 
ttttaaaaaa attaatgtgg gctttttaat 
aacctttctt ccacatgaga agtattttac 
tcaaatcttc aagaaaatga tcaaatcttc 
ttgagtgtga atgccatggt gatgctaatt 
cttcagagtg ttattttcct ttttgtcata 
ataatgaata aagacccagt tttcagaatc 



cccttacttg gggaagactc tctgcttcaa 60 
ggcatgaaaa cggaagtttg agtatttcaa 120 
ctttggggct aaacctatct cctgtattca 180 
tttattagct cttctatgct ttttaaaagt 240 
tctttctcag tgggagggtg agtttgaata 300 
aagttgcttg tcaaatttaa aagaaaatga 360 
tgtgacaaaa aaatggacaa atattcacca 420 
ctgattattt ttcttttctt ttagccatac 480 
aggtgtctgg aaatgaagta tggaaatgaa 540 
tctccacgga gtagagaaac tcatcccaat 600 



40/60 
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ccggaagagc 
gcactcactg 
gaatattatg 
aatgtttcct 
ggtaaaagta 
gtgttacaag 
gggtactgcc 
ttgtatgcag 
ggtgggaggt 



ccgaagaaga 
ctccaaactt 
agacaaagaa 
tttgtgttaa 
cttccattaa 
gcagcagagc 
ctcaggagaa 
ctgtggaaag 
cttaagctcc 



agatgaagat 
ggaggaggaa 
aagttgcttt 
aaaaggtgaa 
aatgataact 
atcagtaagg 
ctcactgtgg 
actggggcaa 
aggaacaact 



gttcaagctg 
ccagtcataa 
tcaacaagaa 
gttttgggat 
gggtgcacaa 
caacagcatg 
cccaagctta 
aaaagatgct 



aaagagtcca 
ctgcaagctg 
agaagaaaat 
tactaggaca 
agccaactgc 
acaacagcct 
ccatgaaaga 
gctctcagta 



agcagcaaat 
tttacacaag 
agccatcaga 
caatggagct 
aggagtggtg 
caagttcttg 
gcacttggag 
tttcacgatt 



660 

720 

780 

840 

900 

960 

1020 

1080 

1110 



<210> 53 
<211> 1120 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3735780CB1 



<400> 53 

gaaatccagt tatcaaaatt gactcaagaa gagagaacct aacagaacaa taacaatgga 60 
agaaattggg aacattatca caaagctatc atcctgccaa actccaggct cagatgtcac 120 
aggttaaaaa aagtccttca tgaaaaagaa agatcttaag cagcatgatg gattcagaag 180 
ctcatgaaaa gaggccacca atactaacat cttcaaaaca agatatatca cctcatatta 240 
caaatgttgg tgagatgaag cattacttgt gtggctgctg tgcagccttc aacaatgtcg 300 
caatcacatt tcccattcag aaggtcctct ttcgacaaca gctgtatggc atcaaaaccc 360 
gggatgcaat acttcagttg agaagggatg gatttcgaaa tttgtatcgt ggaatccttc 420 
ccccattgat gcagaagaca actacgcttg cacttatgtt tggtctgtat gaggatttat 480 
cctgccttct ccacaagcat gtcagtgctc cagagtttgc aaccagtggc gtggcggcag 540 
tgcttgcagg gacaacagaa gcaattttca ctccactgga aagagttcag acattgcttc 600 
aagaccacaa acatcatgac aaatttacca acacttacca ggctttcaag gcactgaaat 660 
gtcatggaat tggagagtat tatcgaggct tggtgcccat tcttttccgg aatggactca 720 
gcaatgtctt gtttttcggc cttcgaggtc ccattaagga gcatctgcct accgcaacga 780 
ctcacagtgc tcatctggtc aatgatttta tctgtggagg tctattgggt gccatgttgg 840 
gattcttgtt ttttccaatt aatgttgtaa aaactcgcat acagtctcag attggtgggg 900 
aatttcagtc tttccccaag gttttccaaa aaatctggct ggaacgggac agaaaactga 960 
taaatctttt cagaggtgcc catctgaatt accatcggtc cctcatctct tggggcataa 1020 
tcaatgcaac ttatgagttc ttgttaaagg ttatatgaaa aaaccatcag ttaagtgcca 1080 
tttatcaact gaatagacct tctaagaaga aaaaaaaaaa 1120 

<210> 54 
<211> 886 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 039026CB1 



<400> 54 

ggccgcggct cctgtccaga ccctgaccct 

ccgccagcct tgtactgatg tcggatgcga 

attggagaca ttcaagcaaa ggttggacaa 

gcatcagaaa aggtgactaa taaaggtacc 

ctgatcagat aaaacagttt aaggaatttc 

gctttttgga ctgtgttaaa gacttcacaa 

gttcagaaca ttgcttacag aaatatttaa 

aggaatatca tattcagcag aatgaagccc 

cacgatagag aagtcctgat ggatgaactt 

tggaaatgag gactcatctg atagaatccc 

ctgtcatgac tgtttggcaa atggaaaccg 

ataatcacaa tagaaggtct tattgttcag 

ccttatgatt cagcagcttg gtcacttgat 

tgactgttaa ttttaaagca acttatgtgt 

ttactcaaag taaaataaat ggaaatatca 

<210> 55 
<211> 2336 



ccctcccaag gctcaaccgt cccccaacaa 60 
gagcctgtgc ttaagtaaga atcaggcctt 120 
ctacttttcc agaacagaaa ggaaactcat 180 
agaagaatat ggctgcacaa ataccagaat 240 
tggggaccta caataaactt acagagacct 300 
caagagaagt aaaacctgaa gagaccacct 3 60 
aaatgacaca aagaatatcc atgagatttc 420 
tggcagccaa agcaggactc cttggccaac 480 
ttgatgaaag attgccaaca gctgctttat 540 
ctgaaagcag tagccaccat gttcaaccat 600 
ctggagaaac aaaattgcta tttaccagga 660 
tgaaataata agatgcaaca tttgttgagg 720 
tagaaaaata aaccattgtt tcttcaattg 780 
tcgatcatgt atgagataga aaaattttta 840 
ctgaaaaaaa aaaaaa 886 



41/60 



WO 00/78953 



PCT/US00/16668 



<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 260607CB1 



<400> 55 

taatacgctc 

tttaggatat 

gactttgaag 

gttgcctttt 

gccactttta 

ccttatcaat 

ccttgtttat 

gggggatgct 

ggctgagaga 
tctgaggagg 
ctcctggggg 
tctgtcggct 
cctgttgtgg 
ttcaggcctg 
cctcctccca 
ctcacagagc 
ctaggcaacc 
aagtgtttat 
ttcttgaaaa 
aagaaaaatc 
aatttagatg 
agtttctttc 
tcatgtctga 
cgctttgggg 
ggacaaaaga 
gtgttttgat 
cttacagtaa 
ggacaagtga 
ccaggatccc 
ctttccttgc 
gctttgttgg 
tggagtgaga 
ggggaaacca 
ggcccttttt 
ggatcctttg 
gtagttgggg 
tcctcataga 
attaggtagt 
aggaaaaaaa 



actataggga 
atttttaaat 
ctgcatctgc 
gcagtttggg 
aggacaggga 
cttttttcct 
gcctcactat 
ctagttggtt 
ggctgttcct 
aaggcctcct 
attagggtgt 
cctggttgca 
gactgctctg 
gcagacgggc 
acccagacat 
cccagagggc 
ccttcctccc 
ggggagatcc 
ccagtagaag 
ctcatcaatg 
cagaacattt 
ctttggatac 
agggggattt 
cacagtctga 
aaaaaaattt 
taaccgtagt 
gattggaact 
gttgaggctt 
ccccatatcg 
ccatgtgtca 
accagatggt 
cctggggctt 
gagctgagac 
cagtcttcaa 
ccctggggga 
gagttccact; 
agccccattt 
ggcagctcca 
tattaccttg 



atttggccct 
tctxtgaaac 
cagttacacc 
ttgtggactc 
gttttatagc 
ttttcctgac 
tggcagaaaa 
ctgtgtccat 
gaatcaaagc 
ggaaactiggg 
ggtgtttgcc 
ctttggggaa 
tcccctgccc 
agcgaggagg 
accctctgcc 
ctgcttgagc 
acctctcatc 
tagtggcttt 
ggaaacagca 
ccaggggaca 
ctctgtattc 
ataaggcttc 
aggggcgaga 
tggcgccttt 
tttttcttga 
gatggatgct 
aagggcaggg 
gtgccacaaa 
attgggctgg 
cccctttggc 
gaggaaagga 
ggcctggaac 
ctcttcaaca 

tgggggccca 

gtctgagggc 
gggcgatccc 
cacttttgct 
cttgcttagg 
agaatacttt 



cgagcagtaa 
acataaccaa 
ccaaatggct 
agctcctgtg 
ccttttctac 
cccctccttc 
gaccccattt 
tttcctctgt 
aatagccagc 
agctaagggc 
aagccaaggg 
ggagaggaag 
ctactgcaga 
ggcctcagtt 
aaactgggaa 
cttcttgctc 
aggggtgggg 
gccattcaaa 
cagcctgtca 
taaaagccat 
agacttagag 
tctatcgggg 
gccgcagccc 
gctggcgcct 
atgaaatagc 
acgagtataa 
actcatgcat 
aggtttgtcc 
gagggctggc 
ctccagcttg 
atggcctctt 
ccaccacaca 
gggtttcttt 
gttggctcta 
gcggtccttg 
agcccctccc 
ttatctacct 
ttaggggggg 
ccaaaaaata 



ttcggcacga 
aatggtttga 
ttaatcccct 
aggggtccgg 
tttcctcccc 
tggaggcagt 
aaaacccaga 
gccaaagaca 
tttcgacaca 
gaggcccttc 
gtagggagcc 
tttggggctc 
gatagcactg 
agctctcaag 
ccagcagtgc 
cacaggagaa 
gttctccttt 
ccactcgact 
cagtaattgc 
ttccctxcca 
taacaccagc 
tacgggacag 
tgaccctcgg 
tagtatggtt 
aggaagctcc 
atggattaac 
aagggtatga 
ttggggaaca 
cgtgaggtcc 
tccctctctc 
cccttctaga 
gccccaaagt 
gagatcctac 
gaaggagaag 
gactcattca 
cacccaccct 
cctagcaaaa 
aaaaagattt 
aaatttaaaa 



ggaccatctc 
ttcactgact 
ctcgggtctg 
ttaggagaga 
tcctcccagt 
tgggagctat 
gaacactgga 
gacagacaga 
tacctggctg 
ccttcagagg 
gagaaattgg 
caggtagctc 
ccgagttccc 
ggtgccttcc 
tagtaactac 
gctggtgcct 
ctttcccctg 
gtttgcctgc 
aggaagattg 
aatactcgac 
tgaaaactgc 
ggaggaggcc 
tcctgtgcac 
gactccggat 
tcgggagcat 
tacctcaatc 
atcccagcca 
ggcaggcctg 
ccactttctg 
actttctata 
gggggctggc 
caggaagcct 
acctccattg 
aggtgaagca 
ggccgtcttt 
ctaatggacc 
caatagacaa 
ctttttccaa 
aaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2336 



<210> 56 

<211> 2200 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 1429651CB1 



<400> 56 

gagaacttta 

ctcgttcagg 

ggatgcgttt 

cctctttgtc 

aggctacttc 

cagcaatgtt 

ttctgtatca 

cctacccatc 

cggtggcatc 

caagatctcg 



cgcctggatc 
accatggaga 
ccccagaagg 
ctggctgttg 
ctggctggag 
ggaagtggac 
gcttatgaac 
tacattgccg 
agaatcccca 
gtagacatgt 



tcatctaact 
gcggcaccag 
gcttggagcc 
gactatggtc 
gggacatggt 
atttcattgg 
ttaatggctt 
gtcaggtcac 
ccatcctggc 
atgcaggtgc 



gacacagaaa 
cagccctcag 
tggggacatc 
cacagtgaag 
gtggtggcca 
cctggcaggg 
gttttctgtg 
cacgatgcca 
tgtactctac 
catcttcanc 



ccctgtaagg 
cctccacagt 
gcggtgctag 
accaaaagag 
gtgggtgcat 
tcaggtgctg 
ctgatgttgg 
gaatacctac 
ctatttatct 
cagcagtctt 



atccagaggt 60 
tagatcccct 120 
ttctgtactt 180 
acacagtgaa 240 
ccttgtttgc 300 
ctacgggcat 3 60 
cctggatctt 420 
ggaagcgctt 480 
acatcttcac 540 
tgcacctgga 600 
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tctgtacctg 
cctggctgct 
caccttgatg 
cttcttggcc 
tgccttccat 
tggaatgtcc 
tctggctgcc 
ggtgctgccc 
tcaagtggcc 
ttcggacatc 
gatgatggct 
cagcaccatc 
gctcatgatt 
ccctgtggtc 
ctacctgcag 
tgaaaagggt 
cctggacttt 
gaagagcatt 
ctccaccgtg 
gtttactcgt 
tcttaccctc 
gatggttcaa 
agtggtgaag 
ggccagagca 
ggacgtcaac 
gtgtggggtg 
aaagaaaaaa 



gccatagttg 
gtgatctaca 
ggctacagtt 
cuggctagca 
attttccgag 
atcccatccc 
aagaacctgt 
ctcttcataa 
tgtgcagatc 
gcgtatccca 
gtgatggtgg 
ttcaccatgg 
gtgggcaggg 
caggccagcc 
ccgcctgtgg 
gccttctggg 
atttacgtgc 
cactacctct 
agctggttca 
cacgaccccg 
tctcagaacg 
gaaaacacgt 
gccatcctgt 
gaagccatca 
ctcattttct 
aacccagggg 
taataaagct 



ggccactggc 
cggatgcccc 
tcgccgcggt 
accggagtga 
atccgctgac 
tctggtactg 
cccatgccaa 
tggtgttccc 
cagagatctg 
aactcgtgct 
cggctctcat 
acctctggaa 
tgtttgtgct 
agggcggcca 
cggtggtctt 
gcctgatctc 
agcctcgatg 
acttctccat 
cagagccacc 
tggtccagaa 
ggatgccaga 
ctaaaaccca 
ggctctgtgg 
tagtttccct 
gcgtgagctg 
tccaaactct 
tttgtttacc 



catcactgcc 
gcagacgctg 
tggtgggatg 
gaacagcagc 
atctgatctic 
gtgcacggat 
aggaggtgct 
cgggatggtc 
ccagaagatc 
ggaactcctg 
gtcctccc.tc 
tcacctccgg 
gctgctggtc 
gctcttcatc 
catcatggga 
gggcctgctc 
cgaccagcca 
gatcctgtcc 
ctccaaggag 
ggaacaagca 
ggccagcagc 
cagctgtgac 
aatacaggag 
ggaagaaaac 
cgccatcttt 
gtttctcttc 
aaaaaaaaaa 



gtatacacgg 
atcatgctta 
gaaggactga 
tgcgggctgc 
ccgtggccgg 
caggtgattg 
ctgatggctg 
agccgcatcc 
tgcagcaacc 
cccacagggc 
acctccatct 
cctcgggcat 
ctggtctcca 
tatatccagt 
tgtttctgga 
ctgggcttgg 
gatgagcgcc 
acggtcaccc 
atggtcagcc 
ccaccagcag 
agcagcagcg 
atgaccccaa 
aagggcaagg 
cccttggtga 
atctggggct 
agtgctccat 



ttgctggtgg 
taggagcgct 
aggagaagta 
cccgggaaga 
gggtcctatt 
tccagcggac 
catacctgaa 
tcttcccaga 
cctcaggctg 
tccgtgggct 
ttaacagtgc 
ctgagaagga 
tcctctggan 
ccatcagctc 
agaggaccaa 
ttaggctggt 
cggtcctggt 
tcatcactgt 
acctgacctg 
ctcccttgtc 
tccagttcga 
agcagtccaa 
aagagctccc 
agaccctcct 
attttgctta 
ttttttaatg 



660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2200 



<210> 57 

<211> 2823 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2069971CB1 



<400> 57 

gaaagacata 

taggtgaaaa 

caacaacact 

tagaaagtgt 

caacaacgac 

tatcttgtga 

aaatgaacac 

cccagagaga 

aagcccacag 

tcagcaatat 

cttgctgttg 

ctcttctccc 

cgagcagaga 

ctagtgtacc 

atgatcatcg 

cagagatgcc 

tttgtgcatg 

atctacttta 

ctggtcttgg 

gtgccaaaga 

ctgtctgtgc 

catgttgcta 

cttagcaaaa 

gaccccgact 

ccaaatttga 

aggaaccaaa 

tccattcacc 

cagtagctgt 

aacagctgtt 

agaacacata 



cacacttcat 
caatgaagcc 
gatgtaggaa 
ataaataatt 
aacaacagcc 
atgataaagc 
ttcatagcaa 
gaccagagga 
aaaagggggc 
gcttcatttt 
tcacagatgc 
tgtggttgtc 
tccttggtgc 
tggcatgtga 
tttccagctg 
ttggccacaa 
cccttggaga 
agccagagta 
ccagcaccat 
gcctgaatta 
acagcctgca 
cagcagccag 
gctttacgat 
gccttttctg 
caggccacct 
ggaagaaatt 
atgaaggaag 
gttcaattgc 
tgtaactatc 
ggtaaatttg 



gtaatgctac 
aggtaatatt 
gctcattatt 
gcagtgctgc 
gcagctcatc 
tgccaagatg 
tgtggaactc 
gctggagtca 
gaatgagtac 
catgattgca 
tgcccacctc 
atcgaagcct 
cctgctctcc 
gcgcctgctg 
cgcagtggcg 
tcacaaggaa 
tctatttcag 
taaaatagcc 
cactatctta 
cagtggtgtg 
catctggtct 
ccgggacagc 
gcactcactc 
tgaagacccc 
tcaaacatgc 
catgtcatgg 
aggcactgag 
aggaatgtgt 
ggcaatacca 
aactcaggaa 



ctgcaagtct 
gcaaggaggc 
ttaatttctg 
tttgcttcca 
ctggccgcca 
tatgctttca 
caacagaaac 
ggaggcatgt 
gcctatgcca 
gaggtcgtgg 
ttaattgacc 
ccctctaagc 
atcctgtgca 
tatcctgatt 
gccaacattg 
gtacaagcca 
agtatcagtg 
gacccaatct 
aaggacttct 
aaagagctta 
ctaacaatga 
caagtggttc 
accattcaga 
tgtgactagc 
tgctatgcag 
tgcaatgcac 
atccatcaat 
atatagatta 
aattcatctc 
agtcttacca 



ccctagaaaa 
tgtaatttta 
gagcctttta 
aaactgggca 
tggagtttct 
cactagaaag 
cggtgaataa 
accactgcca 
agtggaaact 
gtgggcacat 
tgaccagttt 
ggctgacatt 
tctgggtggt 
accagatcca 
tactaactgt 
atgccagcgt 
tgctaattag 
gcacattcat 
ccatcttact 
ttttagcagt 
atcaagtaat 
ggagagaaat 
tggaatctcc 
tcagtcacac 
tttctgcatc 
attttatcta 
caattggatt 
ttcctgagtg 
ccttccaata 
gaaatcagtg 



gcagtttttg 
gcagacctac 
attttttctt 
gtgagttcaa 
tgaaagaacg 
aaggagctgc 
agatcagtgt 
cagtggctcc 
ctgttctgct 
tgctgggagt 
cctgctcagt 
tggatggcac 
gactggcgtg 
ggcgactgtg 
ggttttgcac 
cagagctgct 
tgcacttatt 
cttttccatc 
catggaaggt 
cgacggggtg 
tctctcagct 
tgctaaagcc 
agttgaccag 
cgtcagtttc 
atagaaaata 
tttatttagt 
atatactgat 
gagccgaagt 
atgcatcttg 
gaagggacaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 
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atagtcacaa 
taaaagtgac 
gtgatttttt 
actaattttt 
cttgtttatt 
agtgcttaaa 
aacagccgac 
atttacctgc 
gtgagcactg 
cgcagcaagt 
agatgtgcag 
cttctgcatt 
ttgtcaacca 
agtatcatga 
tgggcatggc 
catatggagg 
aggcacaagg 
teg 



aattttacca 
tetgtatget 
tttctactct 
atttacttgt 
tcaaaacttt 
cactggcacc 
aaaaacattc 
aggaaggaat 
ctttgtgcta 
gcttctgtgt 
gccaacattc 
tttaaaagtt 
caggagttca 
attgeaatga 
cagtgatgtt 
ctctccagga 
cacacttatg 



aaacattaga 
aacgccacat 
gaattggaaa 
cacattttgg 
atataatcac 
agecaaagaa 
gagttgaccc 
aaagcagatg 
gacattgggc 
tctggtagca 
tggaaatcct 
acccagagat 
ttgagtggga 
tgtagtgggg 
teaegtcatt 
agacgaagaa 
tttgtctgtt 



aacaaaaaat 
tagaacttgg 
tatgtatgaa 
caataaatcc 
tgttcaaaag 
tgtggttgta 
caccaagttg 
caaccaattc 
ttagcattga 
actcaacact 
atgtcactgg 
gcxtctaaag 
cagctagata 
tataaaagga 
gaggtgacag 
gagaaggaca 
agcttttagt 



aaggagagee 
ttctctcacc 
tatacagaga 
ctcttatttc 
gaaatatttt 
gagacccaga 
ttgecacaga 
attcagtcca 
aactataaag 
atctgtggag 
gtttggtttg 
atgagecata 
catacattgg 
aagcgatgga 
ctctgctgga 
ttctaggcaa 
tgaaaaagca 



aagtcaggaa 
aagctgtaat 
agtgcttaca 
taaattctaa 
cacctaccag 
agtcttcaag 
taacttagat 
cgagcatgat 
aggaatcaga 
agtaaactga 
gaaacctgga 
gtctagaaga 
cagctacaat 
tattgeegga 
ctttgaatta 
aaagaagact 
agatacaggg 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2823 



<210> 58 

<211> 1491 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No; 2329339CB1 



<400> 58 

cgcctccctc 

gggatcttgt 

ctgcacagcc 

aggctctget 

tggtggtgga 

acttcagagg 

acggtgtgtc 

agctcagcct 

aaattatcca 

atgtctaccg 

atatcctcaa 

agaaggtcta 

atgagggggc 

tgcacgagcc 

tgeagegget 

tcatgtacca 

ggceggaett 

tcagcgacca 

ccctggcccc 

ttggagggga 

cacggcacaa 

tgtgtgttgt 

atgctgtgga 

gggaggtgta 

gcaagcttat 



cagctgegag 
ggctgtcagg 
ctcacaccca 
ccgagggctg 
gggcagacac 
aatgtttgct 
etacaatget 
gagcaatgta 
tttctgctgt 
gctggcagag 
aaactttgtg 
ctccctcctc 
ccttctctac 
cccaaagctc 
gcatgacaag 
ceggaacgag 
ccagtgcgtt 
ggecaagtat 
ccgcatgtcc 
caacaatgtc 
ccgctggttc 
aggcaggtac 
gcgctacgac 
tgcccacgca 
tccctttagt 



tgcggcctcg 
atggcagagg 
cactgcgtga 
ctggctctcc 
atcgaggccc 

gggggattga 

atgtgccaaa 
caagagacac 
gatttcctca 
ctgtttgact 
gccttctctc 
agcagcaatc 
cattatagee 
cttgagacag 
ctggacccca 
agcctacagc 
gtgggcttcg 
ctaaacccct 
aaccagggca 
caaggatttc 
cagatccagt 
atetaegctg 
cctgccacca 
ggcgcgacgc 
gagggttaat 



gctggcggcg 
agcaggagtt 
acaacaccta 
gggacagegg 
atcgcatcct 
aggagatgga 
tcctacattt 
tggtggctgc 
tgtcctgggt 
tgagccgcct 
ggactgacaa 
gcctggaggt 
tggagcaggt 
tgcggtttcc 
gecctttgag 
ccagcctgca 
ggggcattca 
tactgggaga 
tegeggtget 
gagcagagtc 
ccctgcagca 
tggcgggccg 
actcctgggc 
tggaggggaa 
tttagcttgc 



gcaccaggcc 
cacccagctc 
ccgcagcgca 
aatcctcttc 
gctggctgcg 
acaggaagag 
catatacacc 
ctgccagctt 
ggacgaagag 
gactgagcaa 
gtaccgccag 
ctcctgcgag 
gcaggctgac 
gctgatggaa 
ggacacagtg 
gagcccgcaa 
ctccacgccg 
gtggaagcac 
caacaacttc 
ccgatgctgg 
ggagcacgcc 
tgactaccac 
atacgtggcc 
gatgtatatc 
actggccgtc 



acagttgtaa 
tgcaagttgc 
cagcactccc 
gatgttgtgc 
tectgegatt 
gtcctgatcc 
tccgagctgg 
cagatcccag 
aacattctcg 
ctggacacct 
cttccattgg 
accgaggtat 
cagatctcgc 
gctgaggtcc 
gccagcggcc 
aeggagctge 
tccactgtcc 
ttcactgcct 
gtatacttga 
aggtatgacc 
gacctgtccg 
aatgacctga 
ccactcaaga 
acctgcggac 

g 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1491 



<210> 59 

<211> 986 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2540219CB1 

<400> 59 

ggacgccaac ctcggtttga agtccagggc agtggctcct gcggacagcc agcataccag 60 

gggccagtgc actgeattae aaccattgtg aggaatgagg gcctgtgccg tgtggctggc 120 

gggcggcatg gcaggagcaa tttcttgggg gaeagegact cctatggatg tcgtgaaaag 180 



44/60 



WO 00/78953 



PCT/US00/16668 



tcgactccaa gctgatgggg tttatttaaa 
ccagagttac cagaaggaag gtcttaaagt 
gcggggcttc cccatgagtg cggccatgtt 
ccgcggggac cacgcagtga cgagcccata 
agtgttcccc tgggcctcat ctctgcatgt 
ttggacctcc aagtggacat caattagcaa 
tggcccttct gatgcctggg atgcctcatg 
atccccaatg cccactctgc taggctggca 
agctgagaag tcctgaccat gagcaccagg 
tcactcctct gtctgaggat ggggaggggc 
agcctcaagt aacacgtccc cgtgcctcca 
ccgctcctgc acccgtggat cctgaggaca 
tccagtgcgt gctcctcact gtgcacttat 
tcaggtctgc agcaaataaa tgaaag 

<210> 60 

<211> 4023 

<212> DNA 

<213> Homo sapiens 



caaatataaa ggtgtcctgg actgtatctc 240 
gtttttcaga ggcatcactg tgaacgcggt 300 
ccttgggtac gagctgtcgc tgcaggctat 360 
agcgccagga ggtgaacaca ggatgactac 420 
gaagccccga gagctgcaga tgtttgccct 480 
gcgtgggcta ggatggtgca gacactgacg 540 
agtcactgat tcaagccctc caaggttctg 600 
tcaaagagct ttccaagaaa tgtttggtcc 660 
gagccagaaa ccacccagag aaacgttgct 720 
cagtgagctc tgggcccagc cactccctcc 780 
gtctcctctc agcaccgacc aggtttttcc 840 
gcggtagcgc cttcctcacc gcacgctgag 900 
tagtgtctgt tgagtgatta aatcacatcc 960 

986 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 2722462CB1 

<400> 60 

gtccggggcg gcgcgtatgg cggcactatg ccatccacga ctcccaggcc cccagtctca 60 
gctctggggg tgagagttcc ccctccagcc ccgcacacaa ctgggagatg aattaccaag 120 
aggcagcaat ctacctccag gaaggcgaga acaacgacaa gttcttcacc caccccaagg 180 
atgccaaggc gctggcggcc tacctctttg cacacaatca cctcttctac ctgatggagc 240 
tggccacggc cctgctgctg ctgctgctct ccctgtgcga ggcccccgcc gtccccgcac 300 
tccggcttgg catctatgtc cacgccaccc tggagctgtt tgccctgatg gtggtagtgt 3 60 
ttgaactctg catgaagtta cgctggctgg gcctccacac cttcatccgg cacaagcgga 420 
ccatggtcaa gacctcggtg ctggtggtgc agtttgtcga ggccatcgtg gtgttggtac 480 
ggcagatgtc ccatgtgcgg gtgacccgag cactgcgctg cattttcctg gtggactgtc 540 
ggtattgcgg tggcgtccgg cgcaacctgc ggcagatctt ccagtccctg ccgcccttca 600 
tggacatcct cctgctgctg ctgttcttca tgatcatctt tgccatcctc ggtttctact 660 
tgttctcccc taacccttca gacccctact tcagcaccct ggagaacagc atcgtcagtc 720 
tgtttgtcct tctgaccaca gccaatttcc cagatgtgat gatgccctcc tactcccgga 780 
acccctggtc ctgcgtcttc ttcatcgtgt acctctccat cgagctgtat ttcatcatga 840 
acctgcttct ggctgtggtg ttcgacacct tcaatgacat tgagaaacgc aagttcaagt 900 
ctttgctact gcacaagcga accgctatcc agcatgccta ccgcctgctc atcagccaaa 960 
ggaggcctgc cggcatctcc tacaggcagt ttgaaggcct catgcgcttc tacaagcccc 1020 
ggatgagtgc cagggagcgc tatcttacct tcaaggccct gaatcagaac aacacacccc 1080 
tgctcagcct aaaggacttt tacgatatct acgaagttgc tgctttgaag tggaaggcca 1140 
agaaaaacag agagcactgg tttgatgagc ttcccaggac ggcgctcctc atcttcaaag 1200 
gtattaatat ccttgtgaag tccaaggcct tccagtattt catgtacttg gtggtggcag 1260 
tcaacggggt ctggatcctc gtggagacat ttatgctgaa aggtgggaac ttcttctcca 1320 
agcacgtgcc ctggagttac ctcgtctttc taactatcta tggggtggag ctgttcctga 1380 
aggttgccgg cctgggccct gtggagtact tgtcttccgg atggaacttg tttgacttct 1440 
ccgtgacagt gttcgccttc ctgggactgc tggcgctggc cctcaacatg gagcccttct 1500 
atttcatcgt ggtcctgcgc cccctccagc tgctgaggtt gtttaagttg aaggagcgct 1560 
accgcaacgt gctggacacc atgttcgagc tgctgccccg gatggccagc ctgggcctca 1620 
ccctgctcat cttttactac tccttcgcca tcgtgggcat ggagttcttc tgcgggatcg 1680 
tcttccccaa ctgctgcaac acgagtacag tggcagatgc ctaccgctgg cgcaaccaca 1740 
ccgtgggcaa caggaccgtg gtggaggaag gctactatta tctcaataat tttgacaaca 1800 
tcctcaacag ctttgtgacc ctgtttgagc tcacagttgt caacaactgg tacatcatca 1860 
tggaaggcgt cacctctcag acctcccact ggagccgcct ctacttcatg accttttaca 1920 
ttgtgaccat ggtggtgatg acgatcattg tcgcctttat cctcgaggcc ttcgtcttcc 1980 
gaatgaacta cagccgcaag aaccaggact cggaagttga tggtggcatc acccttgaga 2040 
aggaaatctc caaagaagag ctggttgccg tcctggagct ctaccgggag gcacgggggg 2100 
cctcctcgga tgtcaccagg ctgctggaga ccctctccca gatggagaga taccagcaac 2160 
attccatggt gtttctggga cggcgatcaa ggaccaagag cgacctgagc ctgaagatgt 2220 
accaggagga gatccaggag tggtatgagg agcatgccag ggagcaagag cagcagcgac 2280 
aactcagcag cagtgcagcc cccgccgccc agcagccccc aggcagccgc cagcgctccc 2340 
agaccgttac ctagcccagc gcccgaaagc cgtctcttct atgcaataac acaatagtat 2400 
tactctactg cgatgtacgg aactgcggtg tgtgtacaca tactcacgta tatgcacata 2460 
tttatataca ggaagaaaaa agacagacaa gatggggctt ggtttataac caccttgccc 2520 
tgtcttcctt aactccagaa gccagtttgg tgaggggtgg gggtgcggcc accaggtctg 2580 
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agctcttcct 
ccagtcgact 
tgtcactgct 
ccctgcggcc 
gaaagaactc 
tcttcgggga 
ggccagagct 
ttggctctgc 
tccacagtgg 
ggccgatgtc 
ttgcgattta 
gataccaggt 
ccctgtggga 
tgctcacacc 
ctcagttaac 
tctgagcgtg 
tcagtgcaga 
ttctgcagca 
gggcattcct 
tggacagagc 
ccgaccccag 
aagctgggtc 
gagttggaca 
tattgagact 
ggg 



actgtggaag 
gcggggcttg 
actgcttcag 
catgccacag 
ccggtctcca 
ccagcgccca 
cagttagcca 
aagcctccca 
cctccttggg 
tgcaacccag 
cccgttcaag 
ccttcacacg 
cgcagcagcc 
agagatttcc 
gtcgccaccc 
gtggaggtag 
ggtcgctttg 
ggagccgcag 
ggccagcccc 
ctccctctgg 
gggagcagta 
ccatagccag 
agtagcaggt 
cactgtgatt 



gctccagaag 
cccctcacgt 
gctcacaccc 
gtttctgtgt 
gggtggtatt 
ggatctaatg 
gtgccgggcg 
cccagccttc 
gacccagaac 
accacttcgt 
caaaacaacg 
tgtgcactag 
ccgtggcccc 
acagcaggag 
ctcctcccac 
gagggttgcc 
gttccgcttc 
tggcaggatg 
ggccctctgg 
ccttgtcctc 
cctggtcccc 
cacggcatgg 
gtttgttttt 
cccccgggag 



gcccttcaca 
gggccggcct 
ccccgacctg 
tctgctttag 
tcagtgtctg 
gaagcggaat 
gccacagatt 
tctggcttaa 
ccggaggaag 
ggaatgggct 
ttttggttaa 
gaacaggagc 
ggcccagttc 
cggttggtgc 
catggtaccc 
agctgcagtg 
cctgggccac 
gagggtgcga 
tgcctgcttc 
ttcccagcca 
caccccctcc 
ttctcccctt 
aaagcacagc 
tcagactggc 



aggagacccc 
ccatcggcca 
atggcgtgcc 
ggacagaacc 
tgacaatgtc 
tggggcaact 
acactgacca 
cccttgttgg 
gggcatgagg 
cttgaccaaa 
ctaaggattg 
gaacagcaca 
ccagccaccc 
agaagtaggt 
tgtaggagcc 
accctgccac 
agaacggaac 
agggcaagga 
ctgtgacttc 
cagaacgggc 
tcccaaccac 
ccccccttcc 
cctttgggaa 
tttgtctctt 



tcacctggat 
cgtccaaagc 
cgccccctct 
acttaggaaa 
acgcaacacc 
gggcccgtgt 
atctcctccc 
cgaaaactct 
caggaagcgg 
tccctttttt 
tgctaaagcc 
gagagacgct 
tccctggctc 
tcagatgaac 
ctgtatgaca 
agaggcaggg 
acagcatagg 
gtgcactgct 
agaaggcagg 
agggtggcac 
ctccaaggcc 
caggtcaggg 
agcaacacat 
tctctctgga 



2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4023 



<210> 61 

<211> 2345 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2739264CB1 



<400> 61 

gaaaggaggc ggaactcggt gatctgactg gcggtttccc ggccggactg agaaggggag 60 
cgcgctgcgc gtcgcaggag taacctactt ggtctcctgc tttcgcgaca tggccttcaa 120 
ttttggggct ccctcgggca cctccggtac cgctgcagcc accgcggccc ccgcgggtgg 180 
gtttggagga tttgggacaa catctacaac tgcaggttct gcattcagct tttctgcccc 240 
aactaacaca ggcactactg gactctttgg tggtactcag aacaaaggtt ttggatttgg 300 
tactggtttt ggcacaacaa cgggaactag tactggttta ggtactggtc tgggaactgg 360 
actgggattt ggaggattta atacacagca gcagcagcaa actacattag gtggtctctt 420 
cagtcagcct acacaagctc ctacccagtc caaccagctg ataaatactg cgagtgctct 4 80 
ttctgctcca acgctgttgg gagatgagag agatgctatt ttggcaaaat ggaatcaacc 540 
gcaggccttt tggggaacag gaaaagggta tttcaacaat aatattccgc cagtggaatt 600 
cacacaagaa aatccctttt gccgatttaa ggcagtaggt tatagttgca tgcccagtaa 660 
taaagatgaa gatgggctag tggttttagt tttcaacaaa aaagaaacag agattcgaag 720 
ccaacaacaa cagttggtag aatcattgca taaagttttg ggaggaaacc agacccttac 780 
tgtaaatgta gagggcacta aaacattgcc agatgatcag acagaagttg ttatttatgt 840 
tgttgagcgt tcgccaaatg gtacttcaag aagagttcca gctacaacgc tatatgccca 900 
ttttgaacaa gccaatataa aaacacaatt gcagcaactt ggtgtaaccc tttctatgac 960 
tagaacagaa ctttctcctg cacagatcaa acagctttta cagaatcctc ctgctggtgt 1020 
tgatcctatt atctgggaac aggccaaggt agataaccct gattctgaaa agttaattcc 1080 
tgtaccaatg gtgggtttta aggaacttct ccgaagactg aaggttcaag atcagatgac 1140 
taagcagcat caaaccagat tagatatcat atctgaagat attagtgagc tacaaaagaa 1200 
tcaaactaca tctgtagcca aaattgcaca atacaagagg aaactcatgg atctttccca 1260 
tagaacttta caggtcctaa tcaaacagga aattcaaagg aagagtggtt atgccattca 1320 
ggctgatgaa gagcagttgc gagttcagct ggatacgatt cagggtgaac taaatgcacc 1380 
tactcagttc aagggccgac taaatgaatt gatgtctcaa atcaggatgc agaatcattt 1440 
tggagcagtc agatctgaag aaaggtatta catagatgca gatctgttac gagaaatcaa 1500 
gcagcatttg aaacaacaac aggaaggcct tagccatttg attagcatca ttaaagacga 1560 
tctagaagat ataaagctgg tcgaacatgg attgaatgaa accatccaca tcagaggtgg 1620 
tgtctttagt tgacagttca caaacttgtg taaaggtttg tgaaatgcat cttcttactg 1680 
catcagacct tccttaagaa tgaaaccgac cacatggagg gaaaaagaaa acaattcttt 1740 
cttggattgg ttttttgaga agtttactga caaattactg ttcatcaaat ctgaaatagt 1800 
cacctcacag ctcttcaaag aaaacctttg aaagatttat atctaaaagc tgtatttact 1860 
ttaaaagaag tgcataatta ccaaaattgt atgtactatt gtacattttt, acaacagcat 1920 
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tttcttaaac 
ataccaagta 
gctgttggag 
ctgtgggagg 
tatgaaaatg 
attaggatat 
tatttctaaa 
aaaaa 



ataatctgtg 
aatatgaaat 
atttgtactt 
agtaaagaaa 
tatttttata 
aacaaaacta 
aagatgaaaa 



tttaatgatt 
aatctacttt 
gagtataaat 
atccaaaagc 
ttattttact 
cttaaccatt 
cttataaaca 



attgtccatt 
gcacataaca 
gtacaccagt 
atttaatgtt 
tatttggaat 
tttgcaacca 
aattcttaat 



gagcctgtac 
gaacaaacta 
ttttgtattt 
ttgtttttgt 
ttacagagca 
ttttgttttt 
ttgtaattac 



tctgccttcc 
taattacttg 
gtgaactcat 
tctataaaga 
cacctaagca 
taagcctttt 
ttttaaaaaa 



1980 
2040 
2100 
2160 
2220 
2280 
2340 
2345 



<210> 62 
<211> 2085 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> mis cofeature 
<223> Incyte ID No: 



2758310CB1 



<400> 62 

cggagatgtg gcgaccgttt ctggcatcat 
gcggccgggc ctgtctgtgg gagctgcatc 
cacgatcgga catgcatgtg gttctgcggc 
gatacctgtg tgcagctctg tcatttccac 
tggccgtccc ctgctgtgtt cttcaggaga 
agccccacac ccctgtggag gaaggtgctc 
ctcggagccc gctatgtcac ggcagaggca 
gatggcatgg ggcgctttgg caggtctctg 
tggtggtgca ccaatgttgt ccttcgaggg 
tctgcgtgtc accagcgggc ggctgatgcc 
ctctacgtga agctgggcca ggggctgtgc 
acccggaccc tgcgcgtgct agaggacagg 
gagttgttcc ttgaggactt ccaggccctc 
cagccaattg ctgccgccag cctggcacag 
agcgtggctg tgaaggtgca gtacatcgac 
accctggagc tcctgctgcg gctcgttgag 
gtcctccagg acctgaaggg gaccctggcc 
aacgcagagc gctgtgcgcg ggagctggcg 
cactgggaca agtccagcaa gcgcgtgctc 
aacgatgtgg aggccatcag gagccagggg 
atcaaggcct ttgctgagca gatattttac 
ggcaacgttc tggtgcggaa aggcccggac 
gggctctacc agttcctgga ggagaaggac 
atcatcctgc gggacgacgc cgccatgagg 
tacctcctgt tcgccgagat gctcatgcag 
tcgcacctac tgagccgcga agaggcggcc 
gaggccgtca tggcggtgct cagggagctg 
atcaacaccg tgcgcgctat caacgtggcc 
atggctaaaa gggctgtccg gggctggagc 
tacggcacca gcctcctgcg ccacgccaag 
gcgctcaggc tggagacctt ggccatgcgg 
cacctgagcc tcgtgccccc agcggaggag 
gccgcccagg gccggcgggg cccttttcac 
gtgtagacac cccgagcccc gtgggcactc 
aggaggccgt gtaatgacca cacactcctc 

<210> 63 

<211> 3014 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2762348CB1 



tctgagactc ggcagttgct tctcactgct 60 
ctcctcatct gcaggcgctg gaaaaccaga 120 
caaagcacgc cctttggttg tgaacttcat 180 
tctgctctgc tgcacagaag gcagaagccc 2 40 
aacgtcaggg gccttcctcc aaggttctcc 300 
tccaccgcgg tagtgggggc gcccctgctc 360 
cgggagaaga ggaggatgcg gctcgtggtg 420 
aaggtcggcc tgcagatctc cctggactac 480 
tggaagagcc caggctactt ggaggtgatg 540 
ctggtggcag gggccatcag caacgggggc 600 
tccttcaacc acctgcttcc ccccgagtat 660 
gccctcaagc ggggcttcca ggaggtggat 720 
ccccacgagc tcttccagga gtttgactac 780 
gtgcacagag ccaagctgca cgatggcacc 840 
ctgcgggacc gctttgatgg ggacatccac 900 
gtcatgcacc ccagctttgg cttcagctgg 960 
caggagctgg acttcgagaa tgagggccgc 1020 
cacttcccct acgtcgtggt gccccgcgtg 1080 
actgccgact tctgcgccgg ctgcaaggtc 1140 
ctggcagtgc atgacatagc agaaaagctc 1200 
accggcttca tccactcgga cccacatcct 1260 
gggaaagcgg agctggtgct gctggaccac 1320 
cgcgcagccc tctgccagct gtggcgggcc 1380 
gcgcacgcag ccgcactggg ggtgcaagac 1440 
cgccccgtgc gcctggggca gctgtggggc 1500 
tacatggtgg acatggcccg cgagcgcttc 1560 
ccgcggccca tgctgctggt gctgcgcaac 1620 
ctcggcgccc ccgtggaccg ctacttcctt 1680 
cgcctggcgg gcgccacgta tcggggtgtc 1740 
gtcgtctggg agatgctcaa gtttgaagtg 1800 
ctgaccgccc tcctggctcg tgctctggtc 1860 
ctctaccagt acctggagac ctagggtgca 1920 
cttgggctga cggaggtggc ggggctagag 1980 
gcactggggg gctgtgacag cagctgggcc 2040 
tcaagcaaaa aaaaa 2 085 



<400> 63 

ggaggcaaag gccaggctga ggatcagggt 

gctggaggct gtgctgagtg cccggtgcag 

gcctgctgca gaccgaggcx gggccaagat 



ggcccgggtg gcagcgggga ggcgctgcat 60 
gtgagccggt cctgcggagt tgtgccgagt 120 
ggcgtctgtg tttcgaagcg aggagatgtg 180 
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tttgccacaa 

gctcggattg 

tgtgaatgaa 

gacgcaaaat 

ggaaatgatt 

ccagaaccag 

gaagaaaacc 

ggacacttct 

gttcatagcc 

gcgagtctgc 

tcctgtgacg 

gctcaggcag 

agagcctgcg 

aatcaccgtc 

caactggcac 

catgtgcaac 

agatgccaca 

ggcccccatc 

taaattcaca 

gataaaccca 

agactgtggt 

acgcttgctc 

tctgatccta 

ctccaagtcc 

cacatggaat 

aggagtgtat 

aaacaaactc 

ccagatggtt 

caacatcatt 

ggttttcatg 

cagcatcctc 

acccctctac 

tgtgccgtgg 

gctgcaggca 

cccttctagc 

tggagaagag 

ctgcctgggc 

tcatgcacaa 

aggctgggga 

agccatcctt 

ggttgagttc 

taaacacatc 

tcaccatgcc 

aatgggaagg 

cctgtggatt 

ttcatgtggg 

aacagatgtc 

ttcaaaaaaa 



ctgtttctcc 
gttcagttca 
gtcagaaggt 
gagattgtag 
accctggaga 
caggccttga 
caagacttct 
ggcctcctgg 
gggtgtgatc 
cgaggaaacg 
aaagaagaaa 
aaaatcaaga 
gtggagcgca 
anaacacaaa 
tcctggctca 
atcgacgtca 
cgtatcaaga 
atgaccacag 
gctggcttcc 
gccccctaca 
catggaaccg 
tcccagaaga 
cttatgggca 
ttgaacatct 
actcatgtaa 
tttggaaatc 
acatttctga 
tttggtgtca 
ctgcaattta 
atcattttca 
atccacttca 
aaacatcagc 
atgcttctga 
tccaggatcc 
cgttctggcc 
ttcaactttg 
tgcatttcaa 
ctgtctgaag 
ggaatcgtcg 
ctgatcatgg 
cagaacaagt 
ctggatggca 
aatgaaggaa 
ttgttcttgg 
tgatacgact 
gtattaattc 
ttcttgcaat 
aaaa 



aggtggaagc 
aagatttaaa 
gtgaatcact 
ttcagttgct 
ctgttctaga 
aacaaagctt 
ttgagacgga 
agttgaaagc 
caacagggaa 
tgtacttgaa 
ttcagaagca 
agatctgtga 
gagagatgtt 
cagagtctca 
tcaaggtgca 
cccagcagtg 
gggcactgga 
tgcaatctaa 
agaatattgt 
ccatcatcac 
tgatgctcct 
cagacaatga 
tcttctccat 
ttggctcttc 
tggaggaaag 
catacccgtt 
actcgtataa 
tcctcagcct 
tccctgagat 
aatggtgctg 
tcaacatgtt 
aagaagtcca 
ttaagccgtt 
aagaagatgc 
agaggacttc 
gagacgtctt 
acacagcctc 
tgctctggac 
gggtttttat 

agggcctctc 
tctatgtcgg 
cagccgagga 
gttcagtctt 
ctcacctgaa 
taaccacgtc 
tcaaataata 
attttaaata 



tgcatattgc 
tatgaatgtg 
ggagagaatc 
cgagaaaagc 
aaaactggaa 
cctagaactg 
aaccaattta 
agtgcctgca 
gaggatggct 
gttcagtgag 
catattcatc 
tgggtttcga 
ggagagcgtc 
ccgccagcgc 
gaagatgaaa 
tgtcatcgcc 
gcaaggcatg 
aacagcccct 
tgatgcctat 
tttccccttc 
ggctgcactt 
gatttggaac 
ctacacgggt 
ttggagtgtc 
tctatatctg 
tgggatcgat 
aatgaagatg 
tttcaatcac 
gatttttatc 
ctttgacgtc 
tctgtttaac 
aagtttcttt 
tattcttaga 
cactgagaac 
tgcagatacc 
tgtccaccaa 
ctacctgcgg 
tatggtgatg 
tatttttgcc 
tgctttcctg 
ggatggttac 
gtaggctgag 
gtctttgata 
gcatgaaacc 
agaggaagga 
aagtaattga 
ttgtatttga 



tgtgtggctg 
aacagctttc 
ctccgttttc 
ccactgaccc 
ggagagttac 
acagaactga 
gctgatgatt 
tatatgaccg 
tcctttgagc 
atggacgccc 
atattttacc 
gccactgtct 
aatgtgaggc 
ctgctgcagg 
gctgtctacc 
gagatctggt 
gaactaagtg 
cccacattta 
ggtgtcggca 
ctgttcgctg 
tggatgattc 
accttcttcc 
ttgatctaca 
caacccatgt 
cagctggacc 
ccgatttgga 
tcggtgatcc 
atatacttca 
ctgtgtctgt 
cacgtatctc 
tacagtgact 
gtggttatgg 
gccagtcatc 
attgaaggtg 
cacggggctc 
gccatccaca 
ctctgggccc 
aacagcggcc 
gtatttgctg 
cacgccctgc 
aagttttctc 
ggctgcacct 
tcagcccctg 
gtgtattatt 
ctttggcaag 
caaatgaggg 
gaaaataaac 



agctcggaga 

aaaggaaatt 

tggaagacga 

cgctcccacg 

aggaagccaa 

aatacctcct 

tctttactga 

gaaagttggg 

ggttactgtg 

ctctggagga 

aaggagagca 

acccttgccc 

tggaagattt 

aagccgctgc 

acatcctgaa 

tcccggtggc 

gctcctccat 

acaggaccaa 

gctaccggga 

tgatgtttgg 

tgaatgagag 

acgggcgcta 

atgactgctt 

tcagaaacgg 

cagccatacc 

acttggcttc 

tgggaattgt 

gaagaactct 

ttggatacct 

agcacgcccc 

cttccaacgc 

ctttgatttc 

ggaaatccca 

atagctccag 

tggacgacca 

ccatcgagta 

tcagcctggc 

ttcagacgcg 

tcctgacagt 

gactgcactg 

cattctcctt 

cccacggtgg 

caaggcgctc 

tggacgtcag 

tgatattgtc 

gagaatgcta 

atctgagtca 



<210> 64 

<211> 1726 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3014 



3715961CB1 



<400> 64 

tgaaaatgcc 

cacatagagc 

tttctaatca 

gagtcctgaa 

atcagacagt 

caagaagccc 

gactaggcca 

agccaaagtt 

acatatgaac 



tgctgcgtac 
ctgcctggag 
atatgataaa 
agacaaggag 
gtgtgggaag 
acgtgttaga 
tgtcggaatg 
acctttttaa 
atctccgaga 



caaggtatgt 
aagctcatgg 
aaggtcagag 
atgttagtaa 
ggcagagtct 
agcaccggcc 
tcctcttatt 
tgatcttttt 
ggttgtgttt 



actagggcat 
tctgatggaa 
agcagtttct 
agcagagaag 
gaaaagagca 
ccatgagaca 
ttatacatac 
tacccagtgt 
ggcaaaagtg 



ctggggtaag 
agataagcaa 
gaaaaacatg 
ggagaattca 
tgccccattt 
aagacacagc 
ataagcatat 
attctggagg 
aagatgagca 



taaaaacaaa 60 
gaagagttaa 120 
tttttgagtt 180 
ttctagaaag 240 
ggagaagcat 300 
tagagagatt 360 
agatacatat 420 
tcgaatggtc 480 
ttatcccctt 540 
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tggaaatcag 
actgacctag 
ttgcgatttc 
gggctttggg 
ttaaccactt 
atcatgactc 
ccagccgatg 
cttttgtata 
agtctatata 
tggcttactt 
gatgcaaccc 
accccctatt 
gggcagcact 
aagtgatctg 
gaagaatcga 
ttctatgctg 
tgaagtagac 
ctgttatgtc 
gtacatactg 
cctagtgttg 



tcattggagg 
tgaaggttca 
gtggtgtaca 
caggctgggt 
atgatacagt 
acggtttatc 
tcatcaaaag 
aatcatcgac 
aaggcttttt 
atgaaaaaat 
ttaaagatac 
atttctacct 
ttattttttt 
gtcggatctc 
agcctgacca 
aagagcctgc 
gataggaatg 
agtatttatc 
taaattaaag 
aaggaaggtg 



gatgatggct 
gatgcaaatg 
tcatgcattt 
acccaatata 
gaaacactac 
aagtttatgt 
cagaataatg 
tgactgcttg 
accatcttgg 
cagagagatg 
agtgttcagt 
ctttaggaag 
ctggaaaccc 
acaaggccat 
ctttcacctt 
ttagaggagg 
tggaagaaca 
aatgaagttt 
ggaggtgaat 
tactttttct 



ggtgttattg 
gaaggaaaaa 
gcaaaaatcc 
caaagagcag 
ttggtattga 
tctggactgg 
aatcaaccac 
actcaggctg 
ctgagaatga 
agcggagtca 
attattgaaa 
acacctattc 
aagttctctt 
ccaatgagac 
gggcaagaag 
agtaccagga 
catacatagt 
gataattcac 
ggaaattaat 
tgtcagaaag 



gccagttttt 
ggaaactgga 
tagctgaagg 
cactggtgaa 
atacaccacc 
tagcttctac 
gagataaaca 
ttcaaggtga 
ccccttggtc 
gtccatttta 
tatgggcatc 
cacagagact 
tgactcctct 
cccgcacagc 
gtttggcctt 
gggagccagc 
gcttaagaaa 
ttttctgtca 
gaataaacat 
ataaaa 



agccaatcca 
aggaaaacca 
aggaatacga 
tatgggagat 
tgaggacaat 
tctgggaaca 
aggaagggga 
aggattcatg 
aatggtgttc 
aacccctaaa 
tgcaacacat 
gatttatagg 
ttttgtccaa 
attttctaaa 
tgagttgcta 
atttcagatc 
tacatttaac 
ttgttaaagc 
tttgagtttc 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1726 



<210> 65 

<211> 899 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



5108194CB1 



<400> 65 

gcggcggcga cgcgtccggg ccggtgaggg 
tcagcgcccg cgcctgtcgg gctgaactga 
ctggtatggg taatggtgtg aaggaaggcc 
tcctgtcctc gtccgtctca tcaaagcgtg 
ctggggccct ggctggtgcc cttgccaaaa 
tcatcttcca agtgtcttca aaaagatttt 
acacctacct caacgaggga tttctcagct 
gcgtggtgcc ctacgccgcc atccagttca 
gcagctacta tggcttccgt ggagaagccc 
cactggctgg aacgacagcc gcttcactga 
tggccgtaac cccgaaggaa atgtacagca 
gagaagaggg gctgaagact ctctaccatg 
cctacgctgg cctgagcttc ttcacctatg 
gcggccgcaa gcttactccc tttagtgagg 
ttacaacgtc gtgactggga aaaccctggc 

<210> 66 

<211> 643 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 5503122CB1 



ggcgcggggg gcgccggggg ggcccaagcg 60 
ggaccgagtc tcctgccatt ccgagcaggc 120 
cggtgcgatt gcatgaggat gctgaggctg 180 
accacaggca agtgctcagc tccctgctgt 240 
cagcggtagc tcccctggac cgaaccaaaa 300 
ctgccaagga ggccttccgg gtcctctact 3 60 
tgtggcgcgg gaactcggcc accatggtgc 420 
gcgcacacga ggagtacaag cgcatcctgg 480 
tgcccccttg gcctcgcctc ttcgccggcg 540 
cctaccccct ggacctggtc agagcgcgga 600 
acatctttca tgtcttcatc cgcatctcga 660 
gatttatgcc caccgtgctg ggggtcattc 720 
agacgctcaa gagcttgcac agagagtaca 780 
gttaatttta gcttggcact ggccgtcgtt 840 
gttacccaac ttaatcgccc tgcagcaca 899 



<400> 66 

ctttaagctg tagctgtggt ttctgcagca 
ggattatcac acctccatgt atgacaattt 
tggttcagcg gattcctaca caagcaggcc 
ccgggaagca attcgacagg agagagaaca 
gtccaaacct gtagcatttg ccgtgaagac 
ggatgtgcct gttccaagca cagctatctc 
agagaaatat aacaatgatt ggtggatagg 
cttcattcca agtccactca gattggagaa 
acgttttcac ggagggaaat caagtggaaa 
tgggacattc cgagcaactc ccacatcaac 
tacactgtat tccttttaaa aatatttgaa 



attttgtttt tgccttgaaa gaggtgctct 60 
gtacctgcat ggaattgaag actcggaggc 120 
gtctgactcc gatgtctctt tggaagagga 180 
gcaagcagct atccagcttg agagagcaaa 240 
aaatgtgagc tactgcggcg ccctggacga 300 
ctttgatgct aaagactttc tacatattaa 3 60 
aaggctggtg aaagagggct gtgaaattgg 420 
catacggatc cagcaagaac aaaaaagagg 480 
ttcttcttca agtcttggag aaatggtatc 540 
aggtgagggt tgtagttaaa ctctttttca 600 
cacacatgca age 643 
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<210> 67 

<211> 2574 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5517972CB1 



<400> 67 

gcgcctccca 

gaaggggttg 

caggacggaa 

agcctgtgga 

ttaagtggaa 

ttacatcctt 

ttggttaaga 

aatgtcgaag 

tccaatgacc 

cgagtaaaac 

tcgaatatca 

ggcaaatctt 

gatgttctga 

gtacaagatg 

gctcttcggc 

attcaagagt 

ggtgtgtctg 

tccatcttgt 

cttttgctcc 

cctcgatatt 

atgttccacg 

gaggcctata 

gtggcattaa 

gataagccac 

acaaaagctg 

aaggagatca 

ttcaaaaact 

ctgggactgg 

aacagagctg 

gtggaactct 

agagtgtcat 

ccaagtatta 

gccttcttcg 

ctggccatag 

tttgtgttta 

ctgtcatggc 

gaatttttgg 

tatgcaacat 

ggcttgtgga 

tacctgaaat 

tatcctcaca 

ttctgttccc 

tagaaatcac 



cgccggccgc 
tgccgcgcga 
gacgcagtgc 
ggaactgggt 
actgctgctt 
aattgttaaa 
ccgagctcta 
tttttatccc 
tgaaggcatt 
tgaagagtgg 
atgggatcat 
cgttattaga 
taaatggagc 
atgttgtgat 
ttgcaacaac 
taggtctgga 
gaggagaaag 
tcttggatga 
tgaaaaggat 
ccatcttcaa 
ggcctgctca 
ataaccctgc 
acagagaaga 
tcatagaaaa 
aattacatca 
gctacaccac 
tgctgggtaa 
ttataggtgc 
gggttctctt 
ttgtggtaga 
cttatttcct 
tatttacctg 
ttatgatgtt 
cagcaggtca 
tgatgatttt 
ttcagtactt 
gacaaaactt 
gtactggcga 
agaatcacgt 
tgttatttct 
taaaaaagaa 
ttgccatcac 
aacaaactga 



cgccgacgtg 
cggtctgcgt 
tcgccactcc 
aggatttagg 
tagagtttgt 
gcgctgcctc 
ttaagctgaa 
agtgtcacaa 
tactgaagga 
ctttctacct 
gaaacctggt 
tgtcttagct 
accgcgacct 
gggcactctg 
tatgacgaat 
taaagtggca 
aaaaaggact 
gcctacaact 
gtctaagcag 
gttgtttgat 
ggaggccttg 
agacttcttc 
agactttaaa 
attagcggag 
actttccggg 
ctccttctgt 
tccccaggcc 
catttacttt 
cttcctgacg 
gaagaagctc 
tggaaaactg 
tatagtgtac 
tacccttatg 
gagtgtggtt 
ttcaggtctg 
cagcattcca 
ctgcccagga 
agaatatttg 
ggccttggct 
taaaaaatat 
gcactttgat 
actgttgcac 
actaaacatg 



atcgctcggg 
gctgtgccca 
cactgagatt 
aacgcaccgt 
ttggaaggtc 
cgagcgcacg 
aagataaaaa 
ggaaacacca 
gctgtgttaa 
tgtcgaaaac 
ctcaacgcca 
gcaaggaaag 
gccaatttca 
acggtgagag 
catgaaaaaa 
gactccaagg 
agtataggaa 
ggcttagact 
ggacgaacaa 
agcctcacct 
ggatactttg 
ttggacatca 
gccacagaga 
atttatgtca 
ggtgagaaga 
catcaactca 
tctatagctc 
gggctaaaaa 
accaaccagt 
ttcatacatg 
ttatctgatt 
ttcatgttag 
atggtggctt 
tctgtagcaa 
ttggtcaatc 
cgatatggat 
ctcaatgcaa 
gtaaagcagg 
tgtatgattg 
tcttaaattt 
tgaagtattc 
agcagcaatt 
aaagaaccca 



cgcgccgggc 
ctcaaaaggt 
gagagacgcg 
gcacatgctt 
cgggtgactc 
catcctgaga 
ctctccagat 
atggcttccc 
gttttcataa 
cagttgagaa 
tcctgggacc 
atccaagtgg 
aatgtaattc 
aaaacttaca 
acgaacggat 
ttggaactca 
tggagcttat 
caagcacagc 
tcatcttctc 
tattggcctc 
aatcagctgg 
ttaatggaga 
tcatagagcc 
actcctcctt 
agaagaagat 
gatgggtttc 
agatcattgt 
atgattctac 
gtttcagcag 
aatacatcag 
tattacccat 
gattgaagcc 
attcagccag 
cacttctcat 
tcacaaccat 
ttacggcttt 
caggaaacaa 
gcatcgatct 
ttattttcct 
ccccttaatt 
aatcaagttt 
gttttaaaga 
agaaaaaaaa 



gtggttgggg 
tccgggcgcg 
gcaaggaggc 
ggtggtcttg 
atcccaacat 
tcctgagcct 
gtcttccagt 
cgcgacagct 
catctgctat 
agaaatatta 
cacaggtgga 
attatctgga 
aggttacgtg 
gttctcagca 
taacagggtc 
gtttatccgt 
cactgatcct 
aaatgctgtc 
cattcatcag 
aggaagactt 
ttatcactgt 
ttccactgct 
ttccaagcag 
ctacaaagag 
cacagtcttc 
caagcgttca 
cacagtcgta 
tggaatccag 
tgtttcagcc 
cggatactac 
gaggatgtta 
aaaggcagat 
ttccatggca 
gaccatccgt 
tgcatcttgg 
gcagcataat 
tccttgtaac 
ctcaccctgg 
cacaattgcc 
cagtatgatt 
ttttgttgtt 
gatacatttt 
aaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2574 



<210> 68 
<211> 1571 
<212> DMA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 5593114CB1 



<400> 68 

ggcgcggcga cagctagggt tcacggccac 

tttttggtta cgggtccctg atctggaagg 

gatacatcac caactacagc aggcgcttct 

ccggcaagcc tggaagagtt gtgactcttg 



tggggcagag gagccgcgag aagatgtggg 60 
tggatttccc ctatcaggac aagctggtcg 120 
ggcagggcag cacggaccac cgcggggtcc 180 
ttgaagatcc tgcgggatgt gtatggggtg 240 
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ttgcttacag 
aaaaaggagg 
cattcagtgt 
ctctggaaga 
aatatctttt 
atcttttcgc 
gcatataatt 
tacgtttact 
aactggaaat 
aattttaata 
gagttcttta 
atcatcctgt 
aataaatata 
atacacaagc 
ttaattgtga 
aaattggcat 
tttttaaata 
tacaaatcat 
aactgaaaaa 
taaaacagct 
tatacataaa 
caggctggaa 
tttgttttgt 



attgccagta 
ctacagaacc 
attgctatat 
cattgctgaa 
tgaacttgca 
tttggaaaaa 
tagtcttcag 
taaagatctt 
gtcctgaaac 
acataaagat 
gaaaaataca 
ttcacatccc 
aggttatgcc 
ttaaaataac 
atttccttgt 
ttttaaaata 
aaggcatttt 
gatcactaat 
gatgggctga 
tttggaattc 
tatacatata 
agtgtaacct 
t 



ggaaaggaag 
acaacagtca 
attggaacat 
caaattttta 
aattctatta 
ttagtaaagg 
agaattaact 
atttttaatg 
acatatttaa 
ttcctaactt 
gtgaaggact 
aatactattt 
ttcaatatat 
atgcttagaa 
tattcaagta 
cgaaaatttc 
actatatgga 
gatgtagtct 
tactacaaat 
tttgtctgtc 
aaattttttt 
ctggcagaag 



aagaagtaaa 
ttttttatcc 
gtgataatcc 
atgcagctgg 
ggaacctngt 
aacgtttaga 
tcagtgcaca 
tagtgaggat 
aatattggga 
tatgttattg 
cagttcagtc 
tgaaattcta 
tcctatacaa 
atacacaata 
ttaaatgaaa 
cttggaatta 
aaataactca 
gtcattcact 
taatggcaca 
actatctcaa 
tttctttgca 
ccaagaacag 



agcatacctt 
aaaagatccc 
tgattatctt 
tccaagtgga 
gccagaagaa 
agggaaacag 
atgacaatat 
attatttaaa 
tacagtgaaa 
aacacttact 
ttgtttttat 
aacaattaaa 
ttctgtaacc 
atatgaacag 
tcttttgagt 
taatgtactg 
ctaaagcata 
ttgtattaat 
tataatgaaa 
tttgtgtgtg 
gcctgcgtct 
gcacctcctg 



gacttcagag 
acaacaaaac 
ggtcctgcac 
agaaatacag 
gcagatgagc 
aacctcaatt 
gatttggaaa 
cttttatttt 
gaaaaattca 
cactagaagt 
cagagtgata 
ccaaaattcc 
atggtttaaa 
tatttcagcc 
ttttagccaa 
tacctcttct 
aattacatta 
cttataccaa 
atttagtttt 
tgtgtgtgta 
ggccatccca 
gaattataat 



300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1571 



<210> 69 

<211> 1549 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 04477 5CB1 



<400> 69 

cgcgcttagg caggcggtgg cgcggctgga 
tctgcagccg catctcggcc agctctcctc 
ccgggaccga agccgcctgc cgtagcgggc 
gaggacatgc gggagagaga atgagccaga 
gatgtggtgg tacagtggga gctattctga 
tgcagtcatc ttctgtgacg ctttatattt 
ccagtgtcaa ccgagtagtg tctcccggac 
aagaagggcc tcgttccttg tttagaggac 
ccagagcaac atactttgct gcctattcaa 
atcctgattc tacccaagta catatgattt 
ttctcccgag aaaagagtga gatcgtgtca 
tgtagaaata tggactaact taaacctcgt 
tggcttataa agactcggtt acagcttgat 
gcttttgaat gtgttcgtaa agtgtatcag 
atgtctgctt catatgctgg tatatcagag 
ataaaacaaa aactactgga atataagact 
gtgaaagaag catcagattt tgtgggaatg 
gccacaacta tagcatatcc acatgaagtt 
aaatacagat ctttttttca gactctatct 
ctttatcgtg gtctgacaac tcatctagtg 
gccacctatg aattggtggt ttacctactc 
gcaaaaaaag aagaccaaaa gattacagtg 
acagaagaaa aatagtttgg gaacatgtaa 
ttatagtaat cacaccacat tacttggcct 
cagagcctcc aaggaaatgc ctttagaagc 
ccatgtcccc cagacacagt tgggttttgt 

<210> 70 

<211> 2237 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 



gtgccgcggg gagggctgtg ccggttgctt 60 
gccgtccccg gggcgctgtg cgtctccagt 120 
ggccagat cc gcgtcccgcc tcagcggccg 180 
gggacacgct ggtgcatctg tttgccggag 240 
catgtccact ggaagttgta aaaacacgac 300 
ctgaagttca gccgaacacc atggctggag 3 60 
ctcttcattg cctaaaggtg atcttggaaa 420 
taggccccaa tttagtgggg gtagcccctt 480 
actgcaagga aaagttgaat gatgtatttg 540 
cagctgcaac ggcaggcatt tactgtacat 600 
tctcatgctc cccatccgca ggtcacttcc 660 
tttactgcaa tcacagcaac caaccccatt 720 
gcaaggaacc gcggggaaag gcgaatgggt 780 
acagatggac taaaaggatt ttataggggc 840 
actgttatcc attttgttat ttatgaaagt 900 
gcttctacaa tggaaaatga tgaagagtct 960 
atgctagctg ctgccacctc aaaaacttgt 1020 
gtaagaacaa gactacgtga agagggaaca 1080 
ttgcttgttc aagaagaagg ttatgggtct 1140 
agacagattc caaacacagc cattatgatg 1200 
aatggatagc agcacgagga ctgctgtact 12 60 
gaccatggga tacagaagcc agcatggcag 1320 
ctattctaag tggaagtttt gttgtaggaa 1380 
ttcggtaatg tgaaaaaaaa aaaaaaacct 1440 
actcctctct caaaattgcc attttctcta 1500 
tgatttatgg cagtcttct 1549 
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<223> Incyte ID No: 116588CB1 



<400> 70 

gtcactggtg 

tcccctttgc 

acgtggaggt 

ctatttttgc 

gaataccacg 

tttctttggt 

tagctgcttt 

ttggctgatg 

ccagaataca 

tcaacttagc 

ttctaagatt 

gattgggggt 

tcagagagca 

aactgatgta 

ttatgttgct 

attttcttcc 

gtccccggtg 

cgcagtagca 

gattgctatg 

ggtggcattt 

tcagaatctt 

gtatttgttt 

cttactctat 

ggaagaaata 

ggcttaacac 

gaaaaaataa 

ttgtgaggcc 

tagtaggcta 

caggaaagac 

tatatatctt 

ggaaatagaa 

actagtaggg 

cacgaacaat 

tcaatttctt 

ataaactttc 

gataattgcc 

aaaaaaaaaa 

cttggcactg 



ccacggggcc 
gatcgattcc 
cgagggactg 
tatgataatc 
aaattcatgc 
ggctttttga 
gtttgcattg 
gaatttggaa 
tatgctgtga 
atggctagaa 
gaagctttgt 
gtaacgtgta 
gagagaatcc 
aaggacttct 
gtgttccctt 
caggcagcaa 
tttgggctcc 
gccactcttg 
tgtcttctgg 
gtagttcctg 
gggttggcca 
ttggaagtgt 
ttggtgaatc 
aaattttccc 
atcgttggtt 
tggactggaa 
tgttttagcc 
atcaacaatg 
agtgaaaaat 
agggatacag 
gggctgcgtt 
cataatgcta 
agcttgcgct 
atttctgtgt 
tgtacagttt 
tatgtacatg 
aaaaaatgag 
gccgtcg 



ccagcgcact 
agccacacct 
agctgggcga 
ctgctgccct 
tgctgtatgc 
tagaccgagt 
gacaggttgt 
gatttgtatt 
gctggtctaa 
ttggaagtac 
taggttctgc 
ttctttcact 
ttcataaaga 
ccttacccct 
ttattggact 
gtgcaattaa 

tggtggataa 

tgtcccacat 
gactctccta 
aacatcagct 
tcatttccat 
tcttcattgc 
gtgcccaggg 
atactgaatg 
tgaaaacttc 
agttatattt 
tgtgtctttt 
aaagggttag 
ggaaaacgtt 
ccttttcttt 
acacaaaata 
gggaatatgt 
ctactctgta 
tactgaggac 
ttcttatagt 
gataaattaa 
cggccgcaag 



tctgtcttaa 
gtggatgttg 
gttttgtggc 
tcagactcaa 
ctggtattct 
atttggaata 
ttttgccctg 
tgggattggc 
aggcaaagaa 
agtaaacatg 
tggtcacaca 
aatctgtgcc 
acaaggaaaa 
gtggcttata 
tgggaaagtt 
cagtgttgta 
aacagggaag 
gatgctggcc 
ctcattgctt 
gggaactgca 
cattgctggt 
ctgtgtttct 
tgggaaccta 
agaagttaaa 
catttttaaa 
atatccaaat 
gtattgtgtg 
aaaattgctg 
ggagcttctg 
atcttatagc 
aacaatggca 
gaagatgttt 
gttatgtgga 
cctaatcact 
ctaataagta 
aacactgcac 
cttattccct 



gctcctgggc 
ctagttactc 
actcctttgc 
gttaaacgag 
tggcccaatg 
cgatggggca 
ggtggaatat 
ggcgagtccc 
ttaaacctgg 
aacctcatgg 
accctcggga 
ttggctcttg 
acaggtgaag 
tttatcatct 
ttctttacag 
tatgtcatat 
aacatcatct 
tttacgatgt 
gcctgtgcat 
tatggcttca 
atgatactgg 
ttgtcacttt 
aattattctg 
atgaacgtgt 
aatttagagt 
atacctattt 
ttgctaaaga 
tggaacaccc 
ttgagataat 
aggaaaaaaa 
ttgtcatagg 
ttatgaagtc 
ttgccgagca 
tagggatgta 
aaaagtgtcc 
acggagtaaa 
ttagtgaggg 



ctccttattt 
cgcgtccgga 
tcttcagcag 
atatgcaagt 
tagttttgtg 
caatcatttt 
ttaatgcttt 
tagcagttgc 
tgtxtggact 
gatggctgta 
tcacacttat 
cctacttgga 
ttattaaatt 
gtgtctgcta 
agaaatttgg 
cagctcccat 
gggttctttg 
ggaacccttg 
tgtggccaat 
tgcagtccat 
attctcgggg 
tatctgtggt 
caagacaaag 
catgagaatg 
ttagtcatta 
caaagtgtat 
attctacttt 
aggtgaactt 
cttcattagg 
aacttttgag 
ccttcctttt 
tctttctgat 
atgacccttt 
attttatagt 
ttcaaattat 
aaaaaaaaaa 
ttaattttag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2237 



<210> 71 

<211> 1114 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 875369CB1 



<400> 71 

tgggaacgga 

tggagggtag 

ttcccagggg 

cgaggagtcg 

aagacacctc 

ctggggcgga 

ggagaacaag 

cctcgtcctc 

agaggaggaa 

aagcacagca 

gggaggaaga 

agcccatgac 

ctgagacggg 

ccaaagaacg 

agatgcgagc 

ggcggcccag 



gcagccccgg 
gagagcgcgg 
tttggagccg 
cagccggtcc 
gaggaactgc 
ggcctcacct 
atccagctcc 
ctcctcttcc 
gaagaagaag 
ggtggaggct 
ggatggccca 
caaggaggag 
gcgcaccagg 
acacagagag 
tgggttgctt 
cctgggcagg 



gggccccctt 
gaaagcgccc 
gcccgcggcg 
cggggacggg 
tcggcctcca 
tctccctgca 
tcctcctctt 
tccagtgatg 
aagaggaaga 
ctgccgggcc 
gtcctgacgg 
tgggatgccc 
cttattaagg 
atcaacaagc 
ccctgagggc 
tttcagggtg 



gaggcggcga 
cagacgccac 
ccatggctca 
ggtcggaaaa 
catcccaagg 
tcacagagag 
cttccagttc 
gccggaagaa 
agccgaagaa 
cctcgctgga 
atgagcagaa 
ggcagagcat 
gagatggcga 
aagccacccg 
ccccgctggc 
ccagtgggaa 



ggccgcgaag 
tcgcggcgga 
cgtcggctcc 
gagaaagaag 
tcgcaaggcc 
aagcaagcag 
ttctagctcc 
gcgggggaag 
gaagggcaag 
ccagtggcac 
gtcccgaatc 
catccgcaag 
ggtcctagag 
aggggactgc 
caaggcctgt 
gcctgatggg 



ggcgcggggc 60 
cggcggccag 120 
cgcaagcgct 180 
aagagcagga 240 
agcacggccc 300 
aaggcccgga 360 
tcttcttcct 420 
tacaaggaca 480 
gagaaggcgg 540 
cgatcagctg 600 
caggccatga 660 
gtggtggacc 72 0 
gaaatcgtaa 780 
ctggccttcc 840 
ggacgacgct 900 
tgctggtggc 960 



52/60 



WO 00/78953 PCT/USO0/16668 

ctttcccccg tggattggtc tctggcccag cccagtctct tctcaggggc agggggtgga 1020 
ggttggggtc accggcctgc ttggcacccc catctgaaag agcagcactt ctcagctatt 1080 
aaaggccccc tggatagaca aaaaaaaaaa aaaa 1114 

<210> 72 

<211> 998 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1325518CB1 



<400> 72 

gactaaaaaa 

atacattgcg 

aaatgaagct 

tccccttcat 

ttccttatgg 

ccagttaccc 

cttctcctgg 

atgttcctcc 

cagctgcagc 

cagccacacc 

cagaggcacc 

cagaggcacc 

cagccaagcc 

gaaattctct 

ctacaaaagt 

tctcactaca 

cattgataat 



gccatgtatt 
tattttctaa 
tctcctttgg 
tggtgaggat 
catacggaat 
tgggaatact 
attcccctat 
tctccctcct 
acccgctgcc 
tgtagcagct 
tgttggagct 
tgttggagtg 
tgctgcccca 
agaagagtac 
tttctttctt 
ttgatttgtt 
taaataaata 



ctttcgtttc 
aacaataaat 
gcctgcattg 
gacaatgacg 
ttaccacctc 
tacactgaca 
gtctatcaca 
aggggtttcc 
ccacctattg 
gagcctgctg 
gagcctgctg 
gagccagctg 
gaacctcacc 
catgggttca 
ttccaaagac 
tgtggtagtt 
aaatagataa 



tctctaaaag 
ttatagtgtt 
tatgtgttgc 
atggtcaccc 
ctctttatta 
cagggttacc 
tccgtggttt 
cgtttgtccc 
cagctgagcc 
caggggcccc 
cagaggcacc 
cagaggaacc 
cttctccctc 
tttctatact 
tatttcattc 
tttccttgga 
tttagaca 



aagaaaaata 
aatattcata 
ttttgcaagg 
acttcatcca 
tcgcccagtg 
ttcgtatccc 
tcccttagct 
tccttcaagg 
tgctgcagct 
tgttgcagct 
tgttgcagct 
ttcaccagct 
tcttgaacag 
gatgcagaaa 
tgttgtattc 
cttaatttat 



taatttaaaa 60 
gggtcaatca 120 
aagagacggt 180 
tctctgaata 240 
aatacagtcc 300 
tggattctaa 360 
actcagttga 420 
tttttttcag 480 
gcacctctta 540 
gagcctgctg 600 
gagcctgctg 660 
gagcctgcta 720 
gcaaatcagt 780 
taagtgaaat 840 
agagtattca 900 
attgaaaaaa 960 
998 



<210> 73 

<211> 2348 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2060987CB1 



<400> 73 

gggccagccg gctcgcccgg gggccatggc 
cagtcggggg cctcccgggg aagtcatcca 
ctctcgccag actctcacct ggatcccaga 
catctcgacg ctgaaagatg agaccggagc 
cgcccccatc ctcaacttcc tgcgcaccaa 
cagcctcctc catgaagccc agttctatgg 
tcgagaggag ttggatcgat cttcttgtgg 
accagtgttc ccagtgaagc ggcggaaccg 
aggacggcca gcccctgtcc gacggagcaa 
gctgctgggc cgaatgctgg atgagaaaac 
gccggggatg gtgcgcctgg tgtgtggaca 
gtttctagtc tgctacaggt tgaaggaagc 
ccgcctggac tggcccatcg aacgactggc 
gggtgaacat gacaagatgg tggcagcagc 
gcaggcggaa ggcggtggct ccgagatagg 
cttgttcttc gtcgggaacc agctcattgc 
gaatgccgtc accaagcact ggcaggtcca 
ggcaggctcc ttcctcctcc tgggctgcaa 
gaagttcccc ttgcgcatga aagacaacga 
agcggaggat ggggtcaccg ccctcagtgt 
gaactggatc gagatcgcct atggcaccag 
cccggagact gtgggctcgg ggcctcagct 
tgtcaccaag atcatgctgt cggagaagca 
cgtgcggaca tggtctgtga ctcgcttccg 
cccactcgct tcctttaaga tcctggctct 
tgctggcaat gacattggcc cctacggtga 



agcagcggct actgcagccg agggggtccc 60 
tctgaatgtg ggaggcaaga gattcagtac 120 
ctccttcttc tccagtcttc tgagcggacg 180 
aatcttcatc gacagggacc ctacagtctt 240 
agagttggat cccaggggtg tccacggttc 300 
gctcactcct ctggttcgtc gcctgcagct 3 60 
aaacgtcctc ttcaatggtt acctgccgcc 420 
gcacagccta gtggggcctc agcagctagg 480 
cacgatgccc cccaaccttg gcaatgcagg 540 
ccctccctca ccctcaggac aacctgagga 600 
ccataattgg atcgctgtgg cctataccca 660 
ctctggctgg cagctggtgt tttccagccc 720 
gctcacagcc cgggtgcatg gtggggcttt 780 
caccggcagc gagatcctgc tatgggctct 840 
ggtctttcat ctgggggtgc ctgtggaggc 900 
tacaagccac acagggcgca tcggggtgtg 960 
ggaggtgcag cccatcacca gttatgacgc 1020 
caacggctcc atttactacg tggatgtgca 1080 
cctccttgtc agcgagctct atcgggaccc 1140 
ctacctcacc cccaagacca gtgacagtgg 1200 
ctcagggggc gtgcgggtca tcgtgcagca 1260 
cttccagacc ttcactgtgc accgcagccc 1320 
cctcatctca gtctgtgccg acaacaacca 1380 
cggcatgatt tccacccagc ccggctccac 1440 
ggagtcggca gatgggcatg gcggctgcag 1500 
gcgggacgac cagcaagtgt tcatccagaa 1560 



53/60 



WO 00/78953 



PCT/US00/16668 



ggtggtgccc 
ctccgtgcgc 
ctcccggcgg 
cttggccatg 
cctgacggag 
ttcagctccc 
ccactcagcc 
ggctgaggcc 
gagtgggcca 
cactcccctc 
gccatgatgc 
ggaaccccgg 
caccttccct 
aaaaaaaa 



agtgccagcc 
tccgtggacg 
ctcggctctc 
tgggacctaa 
caagagctga 
tcatggggct 
tccagcaaca 
cggcgccgtg 
gacctccgac 
acacctccca 
cttgggatgc 
gttcagggcc 
cttttctgga 



agctcttcgt 
gctcacccac 
ggccccggcg 
ccaccgccat 
tggaacagct 
gtctccccag 
qctccttgtc 
gtgggggcag 
ggccacccac 
agatgaagct 
cctggtcctg 
agggcctcct 
agccaaagtc 



gcgtctctca 
gacggccttc 
ctacctgctc 
ggacggcctc 
ggaacactgt 
cccctcaccc 
tggccaccgt 
ctttgtggaa 
accagccccg 
caatgaaact 
ggggactcag 
tggaataaat 
accctcccca 



tctactgggc 
acagtgctgg 
actggccagg 
ggccaggccc 
gagctggccc 
cgcatctccc 
gggagcccaa 
cgctgccagg 
tggccctcca 
tccttttgaa 
gtgcctccct 
ggttattgtt 
ataaagtcct 



agcgggtgtg 
agtgcgaggg 
ccaacggcag 
ctgcaggtgg 
cgccggctcc 
tcaccagcct 
gccccccgca 
aactggtgcg 
gcggtctcgg 
caacgcagct 
gattcctgtg 
actaggtccc 
cactgccaaa 



<210> 74 

<211> 1139 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 
<223> Incyte ID No: 



2172064CB1 



<400> 74 

ctcgagctgg 

ccacccccgt 

ctgccagtgg 

agccgggctg 

gggtcttcca 

atcacgtgat 

agggacggga 

atgtggctca 

acctgtatgg 

gtataacctg 

gagtcagcgg 

gatggtggag 

gcaccagaag 

ttcaagaagc 

gagatgcctg 

tgtccaagga 

gacaactggg 

ctaataaaag 

agtgtcctcc 



gatgtgtggc 

ggggcgcctc 

gcctggagtc 
caaggctgcc 
ggtcatcttg 
ctggcactgg 
gggcaacaca 
gctgggggct 
gttccagatg 
caatctgtcc 
tttctgtctc 
gacctggaag 
acacaagaaa 
agttccagga 
agttggagat 
ggagcttatt 
actgcgctgc 
ccatgtttgc 
ttacttatcg 



aggttcctgc 
ttgcttcccg 
tatggtgatg 
tgcttcgatg 
gtggctgtac 
gaattatcag 
gatgtcccag 
cggcttgtcc 
cccagctcct 
cgcccctctg 
ttgtttactt 
cacaaatctt 
gcaaccgata 
agaagcttag 
gaactttggc 
caccattttc 
cttgactcca 
accattgctc 
caagtaataa 



ggcggctgct 
tgctcctggg 
agcagagtga 
ccttccaccc 
ccagcgccct 
gaaaggggaa 
gggctggaag 
tggagggggc 
ttgcatgtcg 
agaagaccat 
ttttggagct 
cctcttctaa 
gcctcccagt 
cccaggaaaa 
caactttcct 
tatacatgtg 
cctctacata 
agggaggcat 
aaatcccctg 



ggcggaggag 
attccgcctt 
attcgtgtgt 
cctctccccg 
ctatatgggt 
ggaggaggag 
cctcaggctg 
agccctgggg 
ccgagaacct 
tttcctaaag 
tgtgcttctg 
atacttccta 
ggtggaaacc 
acaaagacca 
catcaccata 
acatatgtag 
caatgactca 
tgctttgggg 
ggaaatcctc 



agccggcgct 
gtgctgctgg 
cacacccagc 
ctgcgttcct 
ttcactctgt 
accctgatcc 
ctctgggctt 
ttgcagtacc 
tgccttggta 
accatgtttg 
ggtttgggga 
acttcagaga 
aaagagcaat 
gttggaccca 
cttaaaatcc 
cagcataacc 
gctaaccaga 
aattattccc 
aaaaaaaaa 



<210> 75 
<211> 863 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 
<223> Incyte ID No: 



2219267CB1 



1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2348 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1139 



<400> 75 

tcagagctag 

tcctggaggc 

gtggacgccg 

tcctcgctgg 

tcctggcctc 

cggtgatcag 

ccggccccct 

cctcccttgc 

ggctgcccca 

gcccccgcca 

cccccgagca 

ggcctggggc 

aagcccgtgt 



cttccgccgc 
cggcctggag 
ccccgtgctg 
ggcccagtat 
ccgggctgtg 

gggggccggg 
ggacaccctg 
tgtccttgcc 
gtcactgcag 
ggaccacctg 
gcactagtcc 
atggctgagt 
ctcagtgcag 



agcctggcac 
gcggcagcct 
ctgctgggca 
ctgccaggct 
tccgcactca 
ctgggcctgg 
cacggccggc 
ctgctgtgtg 
gacgccgacc 
cctctgctgc 
tgcctggtgg 
accccagacg 
gtggagccgt 



ctcaggtgcc 
tggtcttcct 
ccatggtcac 
ggactgtgct 
gcagcctctt 
tgctgggggc 
agggcttctt 
tcctgctgct 
gcctgcgccg 
cgccctccaa 
ccctgggagc 
tctggtccag 
ggggacagcg 



gaccttctac 
gctcctgacg 
aggcctggca 
gttcctctct 
cgcggccgag 
cgggttcctg 
cctgcaacaa 
gcctgagagc 
ctccccactc 
ctcctactgg 
caggatggga 
ggcagacaca 
tgaaggtgtc 



ctgccctact 60 
gcagattgct 120 
tccctgctgc 180 
gtcctggggc 240 
gtcttcccca 300 
ggccaggcag 3 60 
gtcgtcttcg 420 
cgaagccggg 4 80 
ctgcggggcc 540 
gccggccaca 600 
ccaaagtcaa 660 
ttcctctcag 720 
tccagccagg 780 



54/60 



WO 00/78953 



PCTAJS00/16668 



ccccaggcac tgggaggccc tgggtctccc cccagccaca cccagtaggt gtggaggata 840 
aaggcttctg tggaaaaaaa aaa 863 

<210> 76 
<211> 1322 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 
<223> Incyte ID No: 



2308629CB1 



<400> 76 

ccggggggcc 

actccggccc 

caggcgccgt 

cccggatgca 

tctggaggat 

caacaggcgc 

cattgagtga 

gtgtggcaac 

tgcagatgta 

atgaaggggc 

tccaagccat 

ggtacaaccc 

ccacaacccc 

actcacacat 

taggtggggt 

ccacagccat 

aagagtggag 

gctgcatcct 

ggaaggtaga 

acctctgttg 

gcacctttga 

ggataagcag 

aa 



ggcgcgcggg 
ggactacgag 
ggcagggatc 
gagtctacag 
tataagaacg 
agggcctgcc 
tgtaatccac 
attacttcat 
caactcacca 
cggggccttt 
tcacttcatg 
aagctcccac 
actggacgtt 
tacaggacat 
gaccgcctat 
cgcatggtct 
ggctggcaag 
ggtcacattc 
ggggctctcc 
ccaccacctt 
taacctctct 
ctcattcccc 



gaggccgggg 
gcgctgccgg 
ctggagcact 
cctgacccag 
gagggcctat 
cacgcccttt 
cctgggggca 
gatgcagcca 
taccaccggg 
taccgcagct 
acctatgaat 
gtcctctctg 
tgcaaaacac 
atcacaggca 
ttccgagggg 
gtgtatgagt 
tgaagtagca 
tctgtctcct 
cccaggattt 
tccttccagg 
ccatcctggg 
tggttcctaa 



cctgcaggcc 
ctggagccac 
gcgtgatgta 
ctgcccgcta 
ggaggcccat 
attttgcctg 
atagccatat 
tgaaccctgc 
tgacagactg 
acaccaccca 
tcctgcagga 
gagcttgcgc 
tgcccaacac 
tggctagtgc 
tgcaggccag 
tcttcaaata 
ctgaacgaag 
ggaatgctcc 
tggtgttttg 
ccctaagcac 
cctgatgacc 
taaaaagcct 



cccggtacga 
tgtcaccacg 
ccccatcgac 
tcgcaatgtg 
gagggggctg 
ctacgaaaag 
tgccaatggt 
ggaagtggtc 
tgtacgggca 
gctgaccatg 
gcactttaac 
aggagctgta 
ccaggagtcc 
cttcaggacg 
agtaatttac 
cctaatcact 
ccaggggttc 
cacctcaagt 
actaacacca 
gtgcagcaaa 
tgctctagac 
ttaaattaaa 



caagatccgg 
cacatggtgg 
tgcgtcaaga 
ttggaggccc 
aacgtcacag 
ttaaaaaaga 
gcggccgggt 
aagcagagga 
gtgtggcaaa 
aacgttcctt 
ccccagagac 
gctgccgcag 
ttggctttga 
gtatatcaag 
cagatcccct 
aaaaggcaag 
agatgacact 
ggagttagaa 
gttcctgcca 
gcacaccaca 
tgttatagag 
aaaaaaaaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1322 



<210> 77 

<211> 1869 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 
<223> Incyte ID No: 



2660038CB1 



<400> 77 

cagggacgct 

ggctcgagga 

cttcttcctg 

gctcccgtcg 

ttgacctcag 

ccctgcttgc 

ccctgacacc 

aggccttccc 

gggtcagttc 

gcggggcctg 

gagaactgca 

tcaccatcgg 

tgtaccagtt 

gctacctgca 

tggctggggt 

aggcacaggc 

aggcgctaac 

gcggcctgcc 

aggacctcct 

ctgccatgat 

caaggctcta 



cattcctggc 
aatcaccggc 
cctccctggc 
cccgccatcc 
agggtcctgc 
cagctcttga 
cttgactgca 
cctgtcttcc 
ctgctacagc 
tgtattcacc 
ggcccctggc 
caaggtggat 
cctgatgaat 
cacagccgaa 
catgggagcc 
agcctcagaa 
cgagattggc 
ccgagttatc 
gagccagtgg 
gagtggcatt 
caaccagccc 



cagagtcctg 
tcttctcccg 
ttccatctcc 
tgagccatcc 
cttaagcttc 
ctcccaaatt 
aatctaaatt 
aaccccgctt 
tatcccacca 
aatcccctgg 
acataccagc 
ggccttgctg 
ggcatccgac 
gcgacccaca 
tacttgggga 
attgctgtag 
cagaaacatg 
gtcggttcct 
gagatctttc 
gcagtcgtct 
acagatgcac 



acttctttct 
gatccttttc 
caaccccgcg 
cacctgcaac 
tcaccagaat 
ccagctgacg 
cttatcttct 
tctgacaccc 
tggacttctt 
aggtggtgaa 
ggcactaccg 
ccctgcagaa 
tgggcaccta 
gtcctgcccg 
gccccatcta 
ggcaccagta 
gtctggtggg 
ccacccagct 
ctccccagag 
tggccatggc 
agggcaaggg 



cggatcgaga 
ttctctcact 
attccctcct 
cttctgtctt 
ctcctagatt 
tttgaccact 
gcaacctgta 
atttttactt 
gatgagtggc 
gaccaggatg 
aaatgtcttc 
aggcctggcc 
tgggctggct 
cagcgcagca 
catggtgaag 
taagcatcag 
gttatggcgt 
gtgcaccttc 
ctggaagttg 
accctttgat 
cctcatgtac 



tctcgttgct 
tgctggcttt 
ctactcccgt 
ttgcccctcc 
tctatctctt 
tgacatttga 
ctgctgaaac 
tcttactctt 
ctggcagcct 
cagttgcaag 
catgccttca 
cccgccctct 
gaggctgggg 
gctggggcca 
acacacctgc 
ggcatgtttc 
ggggctctgg 
tcatccacca 
gcgctggtgg 
gtggcctgca 
cgggggatac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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tggacgctct 
gtgcctccta 
tgcgctccct 
agcactcctt 
ctttcatcca 
actacaagag 
tccctgggta 
tcccactcac 
ccctccaaaa 
ctgttagatt 
aaaaaaaaa 



gctgcagaca 
cttccgcctc 
ctactacaca 
ggccacttgt 
tccatggggg 
atgagtttcc 
actcacaccc 
cactgccacc 
ccaaagggaa 
caaaggcaca 



gctcggaccg 
ggcccccaca 
gacactaaat 
gcctccacca 
acagccaacc 
cttctttcct 
ctctctcagg 
cttgagtccc 
tcgagagacc 
gagattatat 



agggcatttt 
ccatcctctc 
aacagccgct 
ctatgtcctg 
ccactcccca 
tgggtgttgc 
gctgaacgag 
tcctgctccc 
caattcccag 
tgattataaa 



tggcatgtac 
cctcttcttc 
ttcccagtct 
gtgactactg 
tctgttctca 
tttaaacctt 
tcatcccaaa 
atgcacagtt 
gcgtctggga 
gcaagtttat 



aagggtatag 
tgggaccagc 
ccaccaaatg 
attaggtgac 
gggttgaatc 
ccctacccat 
gtgtatttcc 
ttaaactcct 
cccaggtgtc 
tctgaaaaaa 



<210> 78 

<211> 1881 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2670745CB1 



<400> 78 

gaagaaccga 

cgggttctag 

atagcagagg 

tgttacggtg 

ctgtcgggca 

tgcgtcagat 

cggtgaagtc 

caaccgtcct 

cctgcaatgt 

cagggacagt 

acatttttgg 

ataccagcac 

acttccactc 

gctttgggcc 

ccagaactga 

gccactcggc 

tgaaccggca 

ttgaaccgaa 

acaagattgt 

ttgaccttat 

tgtgcaaact 

ggtgggagct 

atgggtagga 

ctgcccatct 

gttgttgggc 

gggctaggtt 

cacctccttt 

gaagggaagg 

cctccccctt 

aaggagttgc 

gcagataaat 

ttttataaaa 



gcttggctgt 
gctgcaggca 
cagcaggccg 
gacagtgcac 
tcgggcatac 
agatgtgcac 
tgccatccgt 
catcgacgac 
gctctatgcc 
tcctggggcc 
gggctacgag 
catgacatgg 
agccacaatg 
attccattcc 
ggcttggctg 
ctttggctac 
cttccatgac 
ggggaagggg 
cctctttggg 
agatcattct 
ggccgtgatt 
gaatgccatg 
ggaagtttct 
gtcacccacc 
gagaggtgtt 
cctcccccct 
cagctgctcc 
gaatggggag 
tgcctatccc 
agctgttggc 
cccacccttc 
attaattaaa 



gtttatctcg 
gctcgaggac 
tgccgggggg 
ctggagggcg 
tccttcgggg 
attttcaatg 
gggcaagctc 
acagtcctcc 
tttgacgtca 
cgggatggac 
cagcaggcgg 
actcttatct 
ctgggaagtc 
aacaatgaga 
gactgtcccc 
aatggggagc 
ctctggaagt 
ccatgtcccc 
ggtaccagtc 
gacttacaca 
cagtataacc 
accaccaaca 
gccacctccc 
tgctcctttg 
ctctgtgctg 
tgggccgagg 
tgggcctcag 
aagggagaag 
ctcccctctg 
atgagacctc 
cttgagctgt 



ttggggacta 
ccgcggcccc 
gcatgttgct 
ggccccgcag 
gttactgctc 
cagtgtcctt 
ctgtggtacc 
tttggggcgg 
atacgcacaa 
attcagcctg 
actgtttttc 
gtacaaaggg 
acatgtatgt 
tttactgcaa 
cgactccagt 
tgtacatctt 
ttaatcctgt 
gccggcgcca 
catctcctga 
ttttggactt 
tagaccagtc 
gcaatatcag 
ctcctgagcc 
acccctggac 
tgaattcagt 
gccccttccc 
ctctgcccag 
caagcagtgt 
cttgagcctt 
cttctccccg 
cgctgtactc 



aggcgtcggt 
gccccggctc 
gtaaccagtg 
ggtgaaccat 
tggtgaagac 
gcgttggaca 
ctacatgcgc 
gcggaatgac 
gtggttcaca 
tgtcctaggc 
caatgacatt 
cagccctgca 
ctttgggggc 
ccgcattcga 
gctgcctgag 
tggtggttat 
gtcctttacc 
gtgctgctgt 
ggaaggcctg 
tagccctagt 
ctgtttgcct 
tcgccccatc 
tgctgtcatc 
ttggtatacc 
ggggagctgt 

cttggtgctc 
ggccagccag 
ctgagcctca 
gagccttgac 
tcttggggag 
tgaagttcag 



tggcgcgcaa 
ggcctggcag 
gcccagggga 
gctgcagtgg 
tatgagacac 
aagctgcccc 
tatggacact 
accgaagggg 
ccccgagtgt 
aagatcatgt 
cacaagctag 
cgctggaggg 
cgtgccgacc 
gtctttgaca 
gggcgccgga 
aatgcaaggc 
tggaaaaaga 
attgttggtg 
ggagatgaat 
ctgaagactc 
catgatatca 
gtctcctccc 
ttcactgccc 
tccatgtgga 
agcggggtgg 
tgtccccatc 
gttctgctgg 
ggagcttccc 
tgggagctga 
gtggggacca 
ccagctcaga 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1869 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1881 



<210> 79 

<211> 2004 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> mis cofeature 

<223> Incyte ID No: 2676443CB1 

<400> 79 

aatcgcaggg ggcgtggcct agcggtcccg cccccggggc gcgcgcgcgc attggctgtg 60 
cggggtgcgg gcgcgcgggc ggcgctttga accgggcgcg gggcgcgggg cgcggggcgc 120 
tgcggccggt acacgccggg gtagggccgg ggtcgggttg tggtcgggcc gggattgggc 180 
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tctcctgggc catggcagcc gaggcgcgcg tgtcgcgctg gtacttcggg gggctggcct 240 

cctgcggggc cgcctgctgc acgcacccgc tggacctgct caaggtgcat ctgcagacgc 300 

agcaggaggt gaagctgcgc atgacgggca tggcgctgcg ggtggtgcgt accgacggca 3 60 

tcctggcact ctacagcggc ctgagcgcct cgctgtgcag acagatgacc tactccctga 420 

ctcggttcgc catctacgag actgtgcggg accgcgtggc caagggcagc caggggcctc 480 

tccccttcca cgagaaggtg ttgctgggct ccgtcagcgg tttagctgga ggcttcgtgg 540 

ggacgcccgc agacttggtc aacgtcagga tgcagaacga cgtgaagctg ccccagggtc 600 

agcggcgcaa ctacgcccat gcgctggatg gcctgtaccg cgtagctcgt gaagagggtc 660 

tcaggagact gttctcgggt gcaaccatgg catccagccg aggggcctta gtcactgtgg 720 

gccagctgtc ctgctacgac caggccaagc agctggtcct tagcaccggg tacctctctg 780 

acaacatctt cactcacttt gtcgccagct ttattgcagg tggatgtgcc acgttcctgt 840 

gccagcccct ggatgtgctg aagactcgcc tgatgaactc caagggggag tatcagggcg 900 

ttttccactg cgccgtggag acagcgaagc tcgggcctct ggccttttac aagggcctcg 960 

tcccagctgg catccgcctc atcccccaca ccgtgctcac ttttgtgttt ctggaacagc 1020 

tacgcaaaaa ctttggcatc aaagtgccat cctgaccagc cgtgggaatg gctgggctgc 1080 

caggccagac acgctaggtt cttccaaaga gtcccaagcc cagcacctgc tcctggggcc 1140 

acgacctccc tggccgtggc cacccatcct ccgcagcagg cccctgctgt ccccccacct 1200 

gctggctgag ctcctcctgg cctcgtcccc tctcagctgt agctgcacca cccccgctct 1260 

ggctaccagg ctctcccggc tgggcactgc gtggccttgc ccctctcccg ctggcagctc 1320 

ctcaggggaa caggggctac cagaggctga tttctcccct ctcctgggcc aggggagggg 1380 

tattatccct gcctcctgcc cccgatgccc aaagcagcat cttccagcac tttccatcga 1440 

ggacttgggt ggcagagtgt gggtgcagcc tggctgttgc tcacccaagt gctagctctg 1500 

cacttcgtgt ctgctgagag caaccagacc ttccatgtcc tcgggcagct gcaactcccc 1560 

gcgagacccc gcagctgggt gggatgaaca agcaacgcag accacaagcg agtgcctggg 1620 

agggagtggc ccagggtggt tctggagcca ttgtgggtga gggtcgaggg ccaccgaggt 1680 

cccgcgcacc gctgcctgcc ctgcagtggc tttaacagtt agttttgcca aagcctctcc 1740 

actcaccagc aggcggtctc tgtcttcagg gattgtgcct gcgtccctcg ggcacctggg 1800 

cccccccgct tggctccctg ggggaatggc ccaggcgggc cgcggttcct ccttagggcc 1860 

ttctccccga caaggagtcc gacggggcgg atgctgcatc ctctgcctcc ctggtcgctg 1920 

ggcttcaccc cacctgggaa gggcagtgtg ctctgtgggg gctgcaatca ataaatgccg 1980 
ggagctgcca aaaaaaaaaa aaaa 2004 



<210> 80 

<211> 3555 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> mis cofeature 

<223> Incyte ID No: 3295764CB1 



<400> 80 

gaccgtgggc gagtccagaa 
aagtgcctga ccctcactcc 
cctcccctcg cgccccggat 
gcggccactg tgctcctgcg 
gcgctgctct gcgcctacgg 
ccgtacctgc tggggccgga 
ccagtatgga cttactctta 
ctccgttata aacctgttgt 
ctgctctatg cccagggact 
acagccactg aaattgccta 
cagaaagtca caagttactg 
ctagggcaaa tccttgtctc 
cttacctgtg tttcagtggc 
ctcttctttc accacattcc 
ggtggcattg ttactgacac 
tcaaaaatcc ctctaaatat 
cgtctccttg tattgaaagt 
cttctctgct ggtctgtgtg 
tacacacagg gcctgtggga 
ggcgtggagg ccgtttcaac 
aaaatatcct ggtcaacttg 
gctgcagtgt atatcatgga 
gtcttcagaa tcatctacat 
ctcagcatgg aacgctatgc 
cagacgctgc tcactctaat 
cagtttttga tctatgccag 



cgtcctggcc ttacagggag 

agttggcgga ggaggagaag 

ggatgtgccc ggcccggtgt 

gaccgctcgg gtccgtcgcg 

cttcttcgcc agcctcaggc 

caagaacctg accgagaggg 

cctggtgcta ctgttccctg 

tctactgcag gggctcagcc 

gctggccatt caatttctag 

ttactcttat atctacagtg 

tcgaagtgcc actttggtgg 

agtggcaggc tggtcgctgt 

ttttgctgtg gcctggtttt 

ttctacctgc cagagagtga 

cccagcttct aaccaccttc 

ggaggagcct cccgtggagg 

actatggaat gatttcctga 

gtgggccctc tctacctgtg 

gaaagtgatg ccttctcgct 

cttactgggt gctgttgctg 

gggagaaatg acattatctc 

cactgtgggt aacatttggg 

gttactcatc acgatagcaa 

cctagtattt ggtgtaaana 

tgtggtagat gccagtggcc 

ttattttgca ctcatcgctg 



aaggcgtcac tcgcggttac 60 
gaaggggccg ggccgggtcc 120 
ctcggcgggc ggcggcggcg 180 
aatgctggtt cttgccgacc 240 
cgtccgagcc cttcctgacc 3 00 
aggtcttcaa tgaaatttat 3 60 
tgttccttgc cacagactac 420 
ttattgttac atggtttatg 480 
aattttttta tggcatcgcc 540 
tggtggacct gggcatgtac 600 
gctttacagt gggctctgtc 660 
tcagcctgaa tgtcatctct 720 
tacctatgcc acagaagagc 780 
atggcatcaa ggtacaaaat 840 
ctggctggga ggacattgag 900 
aaccggaacc caagccagac 960 
tgtgctactc ctctcgccct 1020 
gctattttca agttgtgaac 1080 
atgctgctat ctataatggt 1140 
tgtttgcagt tggttatata 1200 
tcttttctct cctgattgct 1260 
tgtgctatgc atcctatgtt 1320 
cttttcaaat tgctgcaaac 1380 
ccttcattgc cctggcactg 1440 
ttggattaga aattaccact 1500 
tggttttcct ggccagtggt 1560 
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gcagtcagtg ttatgaagaa atgtagaaag 

accacttcat aatatactgc tgaagggctt 

tgcctggatg tatttgatct tttaaagcgt 

acttcacagc agccacttga ctaatacctt 

aggagccaga agtaaagttt atttcatgga 

ttttttctga ttcaaaagtg aacttgattt 

acatggtgtt gtatacttca ttagcaagtg 

gtatctactt agccagatgc ttggcctggt 

gtngcaagtc atgatggatg gaaatatgtt 

tcttatttct gtgttactca tgtacattgg 

atatttgttc tggtttagtg ttctgtattt 

ttattttaaa aaaattcatc atatgaatgt 

tctgtaaaat ttgtttagaa gtacgttttt 

atttagttag aaaataaggc aagttttgaa 

taaggtacgc ttgcttgtct ttaaatcttc 

aagttcatat ataggctaaa gatgtaacta 

ccaaaataaa atgaaaaatc tttttttaat 

aatttgtttt tcttatccaa gtaaagatgt 

ttcataaact gagcctcttt tcaattattt 

tcctgcattt ttttacctta aatctgcctt 

tgcttaaaat tgctagtgat ttcattacca 

taattctgaa atcagtctac caaggggctg 

aaagctgact ttgttgcctt atcttgaaaa 

ttaacttttt tagaatgaca cagctgaaca 

gtcgacatac tcatttggtt taagcaggaa 

aggtcatttt ctattcacca tcctttacta 

gaagggaaag ggaagtgata tcataaaagt 

ttcttttttt gctttgtttt gtttgtgcta 

gaaaagtgct gtttgaggga ttttattaaa 

ttcatgttac caaagcaata tttccctgga 

gagccttttc ttatataaga caagaatatg 

gaatgctgaa ctcaatgtgc aagttgtact 

tttgttgtac tttttattca attctgtata 
ttttacagta tattt 

<210> 81 
<211> 1293 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_f eature 

<223> Incyte ID No: 3438320CB1 



ctggaagatc cacaatcaag ttctcaagta 1620 
cttcttatag caagaactct gcacagcaac 1680 
agacatatat ttatgaatgt gcatttcttg 1740 
gtgttccggg aataacatga tactattcag 1800 
ttatttatga gagccaattt aaggatgact 1860 
taaaaaccag tcaagagcaa tcaaagcagc 1920 
agtttggtgt tttataggtc acatatgtct 1980 
gggaccaggg ctccacagag gccacaaaat 2040 
ctaacagcat ctgcctctat tcaatttaat 2100 
tctttctaca tagttattct atcactggca 2160 
taaggtgtac gtatcatttc taattttaag 2220 
tcttggttcc cattgtgacg attatttatt 2280 
gcattattca tatgcttccc agagaagctc 2340 
gcctgctaaa tgaagagact taagaaagct 2400 
aatatgaagg actattaatt ccaagattaa 2460 
ggccatttgt atttgtattc ccttttattt 2520 
aatttcatcc ctatttatag tttttatatt 2580 
caataggaat tgcattagtc caaggccttt 2 640 
caatgggaca ggaactagga tagatgtgat 2700 
tgtttctaaa ggtagatcat cttgaatatt 2760 
agttacttga aaaaatgttc tatatgcatt 2820 
ctagtatatg tcagacatga aaactatttt 2880 
gaatctagat aggtgctttt aactggggta 2940 
gtgttaataa tagtgtgtca agattgcaaa 3000 
tcctagaagc aaatggatgg ggataagaat 3060 
ttaagggaaa ggaaaagaac actagctaag 3120 
agcaaccttc attttacatt ctgtctgttt 3180 
atttgggaat tgtgtactcc gaaacaagta 3240 
tcttttttta atggaatgtg gtacaaattg 3300 
atttaattca aagtttgtgg catacaacct 3360 
ttcacatctt ggtatgtggc catatttata 3420 
gtatgcagtt ttgtaaataa gtgaaaataa 3480 
gattataaaa ttatttttat taaataaata 3540 

3555 



<400> 81 

atatgcccca cgcgggactc atactacgtt 
ccctgatatt tatctcagtg gacggtggcc 
ataaacgctc tcccctcggc tcccggacgc 
ctgaggagca aaggagtaac caagagatcc 
ccggggcctg gtggctgggc cagacttgga 
ggaggtggcc caacataaca ggcccgaaga 
cgacctaacg tcattggcac aggaatacaa 
agttgcaggc caggatatca gccactggtt 
catagatccg ctgaccggct gcctgaggta 
tccgcctcag ctgccctgtt cggactgggc 
gtcgtattat gaggtggggc ggctgtctgc 
gctcacgtcg caggagcaca cactggaggt 
acaccgctat ctcccctata actcacatgc 
gaacctgaac atggatttta ccctggaaga 
tgactatctc agtatggacg gtacacttca 
tgatctcacg gagttgtagg caaggagatg 
agtttggctt tttctgtgcc ttgaggaaaa 
atctccactc ctctccagga gctagcctgt 
tgccttcatt acagtttgct ctgagaaaat 
aagatcaagt accttggttt agggagatgt 
tgtttgatag ttaagagaga gtagttctac 
aatgatggaa atgagatgtc tgcaggagat 



tcccgtgaac acgtgcagtc caaaccccgc 60 
ggaaaaggac aatggtttcc atgtcagcgg 120 
gacggaggtc gtagtagtag tgagtacgtg 180 
agtgaccgac agagcaagag ccatgccgcg 240 
gtattttcag cgtcgctatt tcacgccggc 300 
cctctgggta tcttacctgg gacgcgtgta 3 60 
ggggaacctg ctgctgaaac ccatcgtgga 420 
tgatccaaag accagagaca tccgcaagca 480 
ctgcaccccg cggggccgct ttgtgcacgt 540 
caacgatttt gggaagccct ggtggcaggg 600 
caagacccgg agcatccgca tcattaacac 660 
gggggttctg gagtccatat gggaaatcct 720 
tgccagctac acgtggaaat atgaagggaa 780 
gaatgggatc cgggatgagg aggaagaatt 840 
cacacctgca atacttctgt acttcaatga 900 
tacactcgtg tagactcaag acgtatttcg 960 
gtggtggggc cgaggggtgc ctggacccag 1020 
gcccttctga agtgtaaagg ccctattccc 1080 
tagtgaatta atctttggga atgatacaag 1140 
agaagaggat agtcagagtt caggcagaac 1200 
aggggtgagg gatggaagga cttttttggc 1260 
ggg 1293 
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<210> 82 
<211> 1489 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 3986488CB1 



<400> 82 

cggggctggc 

aaggcgctgc 

gcagccacca 

tttgataaga 

ataagtggat 

ttcaccattc 

aacatcatcc 

cttgctttta 

aatggagttt 

ttctacaatg 

ctgcaacagg 

ctttcctgtt 

tcagccctga 

atattaatcg 

atggcagggg 

gtgggtgaag 

gactgacttg 

tcggattcgt 

tacctcatat 

cctgtggaaa 

tgctcttcta 

gttgatgtaa 

ccaaagcaaa 

gttaactgcc 

aatcatcaaa 



gggcggcgct 
tgaccaccta 
taatgatact 
aaattcctgt 
tatcaagcac 
cacttacctt 
tcagtgtctt 
acttagaagg 
ataccaaaca 
cctgcttcat 
ctactgaatt 
ttttggggtt 
cgacagcagt 
gtggagacta 
gcttgagata 
aaaacatctg 
tgactgcggg 
gacatccacc 
gcaggatgag 
gcaaatcttt 
gggacctatg 
tggtgactct 
gggccagttg 
agtaaagtta 
gtaaataaaa 



cttctacggg 
cggtttcccg 
atatgtgtcc 
aaagctgttt 
aagtaaatta 
acttctggaa 
tgccattatt 
ctatattttt 
gaaaatggac 
gattatccca 
caaccaatgg 
tctgctgatg 
ggttggagcc 
cattttctct 
ttccttttta 
tttggatttg 
ctgggggggc 
ccctgggcaa 
ccaatggcag 
tcaaaataag 
gggccttcgt 
tttctgatca 
tttcagttta 
tttgttagct 
aatcatttct 



acctgctcct 
tcaccaattt 
aagctaaaca 
cctctgcctc 
agcctaccga 
accatcatac 
ctcggggctt 
gtattcctga 
ccaaaggagc 
actcttatta 
aagaatgttg 
tactccacgg 
atcaagaatg 
ttgttaaact 
acactgagca 
aagagctaaa 
attcccagta 
gtgagagcat 
tctcaagaaa 
ccactgggac 
ggcatctctg 
gcaccttggc 
aacagacatg 
ttgatgaaag 
atgtaaaaaa 



tcctcatcgt 
tccttggaat 
aaatcattca 
tcctctacgt 
tgttcaccgt 
ttgggaagca 
tcatagcagc 
atgatatctt 
tagggaaata 
ttagtgtctc 
tgtttatcct 
ttctgtgcag 
tatccgttgc 
ttgtagggtt 
gccagttaaa 
gagtctgcag 
ggaatgtgaa 
ctgcaaaatg 
tgtactcggg 
tcggtaggtg 
tgctgtgtgc 
cgtgattccc 
tccttagtct 
ctatgttggt 
aaaaaaaaa 



gcttgtcaac 
tggacagatg 
cttccctgat 
tggaaaccac 
gctcaggaaa 
gtattcactc 
tgggtctgac 
cacagcagca 
cggagtactt 
cactggagac 
acagtttctt 
ctattacaat 
ctacattggg 
aaatatttgc 
acctaaacct 
caggattgga 
gccagaggtt 
caaagagaac 
cgacacctta 
gagccccagc 
tggggaggag 
aaggtcccag 
aataaaatta 
atctttccct 



<210> 83 
<211> 927 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> misc__f eature 

<223> Incyte ID No: 4378816CB1 



60 

120 

180 

240 

300 

360- 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1489 



<400> 83 

ctccctgctt tcctctgccg catggtcctg 
gcgctgagtg ggagctgggg gccgaggctg 
tggcggcggg ctgcgccctg ggcctgcgcc 
gggcgctcat ctggctctgc tacgacgcgc 
tctacttgtc tttagtagga aacgttgcaa 
aagaatatgg caaagctgat gcaagatggg 
aaattctgac cgtcgccctg gatgggtctc 
aagaaaaata ttaccggcat ttcctgcaga 
gctggatgac cttcctccca gagtggctca 
ggctgtactg ttggctttac ctgttttttt 
tgctactgtg gcagtcatgg ctagaactca 
tgaagaagtt tcagtgaact ttcaaaacca 
aatgaatcaa atctttttgt ttggccaaaa 
tgtattgttg ctcctgaaca acctgtttca 
tattgttgtt caattaaatg gtgatatagg 
taaaatgttt aattatacaa aaaaaaa 

<210> 84 
<211> 970 
<212> DNA 

<213> Homo sapiens 



ggccgttggc gtcggaagcc tgaagcatgg 60 
gcggttcgct gctgctgtgc gccgcgctgc 120 
tgggccgcgg gcagggggcg gcggaccgcg 180 
tggtgcactt cgcgctggaa ggcccttttg 240 
attccgatgg cttgattgct tctttatgga 300 
tttattttga tccaaccatt gtgtctgtgg 360 
tggcattgtt cctcatttat gccatagtca 420 
tcaccctgtg cgtgtgcgag ctgtatggct 480 
ccagaagccc caacctcaac accagcaact 540 
ttaacggtgt gtgggttctg atcccaggac 600 
agaaaatgca tcagaaagaa accagttcag 660 
taaacaccat tatctaactt catgaaccag 720 
tgtaatacat tccagtctac actttgtttt 780 
aattggtttt aaggcgacca gttttcgttg 840 
gaaaagagaa caaatttgaa tttgtaataa 900 

927 
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WO 00/78953 



PCT/US00/16668 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 4797137CB1 



<400> 84 

ggatgcagca 

tcctgttctt 

tccgaggtta 

tcactgacaa 

caatatttca 

tttcctgggt 

taaaggacaa 

ttcccatgac 

cccttctttc 

gaatggggag 

ctattgaggt 

gtgtccgttg 

tatgacacaa 

ctgtcagaga 

taaagacttt 

tcaggaagtc 

tgaagaaatc 



gagaggagca 
ccagggtgtt 
tgttggagaa 
gcttactata 
ttatcagtct 
tggaaatgta 
tgggacattc 
agagctaaca 
catccttgtc 
gaaggctgct 
ttccgatgac 
cgctgagtgc 
gaagagtcac 
aagtggagct 
aggaaccact 
tcataagaga 



gctggaagcc 
tatatcgtct 
aagatcaagt 
gactggacat 
ttccagtacc 
tacaaagggg 
agctgtgctg 
gtcacagaaa 
tttgtgccct 
gggctgaaga 
actgatcagg 
ctggattcag 
ctaaagacag 
ggcctggacc 
tatttattga 
ctcatgactt 



gtggctgcgc 
tttccttgga 
tgaaatgcac 
atcgccctcc 
caaccacagc 
atgcatctat 
tgaagaatcc 

ggggttttgg 

cagccgtggt 
agaggagcag 
aggaggaaga 
actatgaaga 
gaaacatccc 
atagcgatgg 
ataaatgttc 
cccctttcaa 



tctcttccct 
gattcgtgca 
tttcaagtca 
cagcagcagc 
aggcacattt 
aagtataagc 
cccagatgtg 
caccatgctt 
ggttgctctg 
gtctggctat 
ggcgtgtatg 
gacatattga 
attccactgg 
acaatcctgg 
ttgttgtatt 
tgaattatgc 



ctgctgggcg 60 
gatgcccatg 120 
acttcagatg 180 
cacacagtat 240 
cgggatcgga 300 
aaccctacca 360 
caccataata 420 
tcctctgtgg 480 
ctgctggtga 540 
aagaagtcat 600 
gcgaggcttt 660 
tgaaagtctg 720 
cagctaaagc 780 
agatcatcag 840 
tataaactgt 900 
tgtaattgaa 960 
970 



<210> 85 

<211> 594 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc__feature 
<223> Incyte ID No: 



5470806CB1 



<400> 85 

gacaggatgg 

gaagctggcc 

gctgtccgag 

ggaagctggg 

accacagatg 

tggaaggctc 

gactctggtc 

actgctgtcg 

ccaaagcagt 

caactcggga 



cttcccttcg 
ctgggggtgc 
gcagccctgc 
agccgtttgc 
agaagttcac 
ttggcatttc 
atcgccacta 
tcagtaaccc 
agcttcccat 
agatgccgtg 



cctgttcctc 
tggagaatcc 
tgttgatgtg 
ctctgggaag 
ggaaggggtg 
cccattccat 
caccatcgca 
ccagaactga 
ttgtactgaa 
aaacgatctt 



ctctgcctcg 
aagtgtcctc 
gccgtgaaag 
accgccgagt 
tacagggtag 
gaatacgcag 
gccctgctca 
gggacccagc 
acagtgttct 
attaaaccac 



ctgtactgat 
tgatggtcaa 
tgttcaaaaa 
ctggagagct 
aactggacac 
aggtggtttt 
gcccgtactc 
ccagtaggac 
tgctctataa 
ctgtgatgcc 



atttgcgtct 
agtcctggat 
gactgcagac 
gcacgggctc 
caaatcgtac 
cacagccaat 
ctacagcacc 
caggatcttg 
accgtgttag 
aaaa 
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180 

240 

300 

360 

420 

480 

540 

594 



<210> 86 

<211> 618 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 5473242CB1 



<400> 86 

gtgttgactc 

tactgttaat 

cagtctgctg 

ctctgtctct 

cgccttcgat 

tgaactccac 

gattgtgatt 

cttccagaag 

ttttcctgct 

ttgtcaaaat 

tgttctgttg 



gcaacctcag 
ggcctgtggg 
attgtctacc 
gctatcatgg 
gatggcctga 
tgtgacaagc 
atgatgggcc 
gtggtggctg 
cttgtctttg 
gacaaagacc 
agataccc 



gaacagacac 
gaaaggtgaa 
cttggaccca 
gtaaccccca 
aacacttgga 
tgcatgtgga 
accacctggg 
gagtggccag 
tgcaatggtc 
tttgaaaatc 



catggtgcac 
ccctgttgaa 
gaggtacttt 
ggtgaaggcc 
caacctcaag 
tcctgagaac 
caaggaattc 
tgccctggct 
aattgttccc 
tgtcctacta 



ctaactgatg 
attggcgctg 
tctaaatttg 
catggcgaaa 
ggcacctttg 
ttcaggctcc 
accccgagtg 
cacaagtacc 
aagagagctt 
attaaagcat 



ctgagaaggc 60 
agtcccttgc 120 
gggacctgtc 180 
aggtgataaa 240 
ccagcctcag 300 
tgggcaatat 360 
cacaggctgc 420 
actaaacctc 480 
ctgtcagttg 540 
ttggttcaag 600 
618 
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